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REMARKS 

The foregoing amendments to the claims are of formal nature, and do not add new matter. 
Claims 1-6 were pending in this application and were rejected on various grounds. Claim 1 has 
been amended, and Claim 6 has been cancelled without prejudice or disclaimer for pursuit of 
their subject matter in latter continuation or divisional filings. The rejections to the presently 
pending claims are respectfully traversed. 

The changes made to the Specification and Claims by the current amendment, including 
d e l e tions and additions , are shown herein with deletions designated with a strikethrough and 
additions underlined. 
Priority 

As explained in detail below, Applicants rely on the gene expression data for patentable 
utility. This data was first disclosed in PCT/US00/23328, filed August 24, 2000, on page 93, line 
3. Priority to this application has been properly claimed in the present application. Hence, the 
present application is entitled to at least the priority date of August 24, 2000. 
Specification 

The title of the invention has been amended to better describe the claimed invention. In 
addition, embedded hyperlinks have been removed. 

The PTO has stated that the application contains sequence disclosures that are 
encompassed by the definitions for nucleotide and/or amino acid sequences set forth in 37 C.F.R. 
§ 1.821(a)(1) and (a)(2). The PTO states that the application fails to comply with the 
requirements of 37 C.F.R. § 1.821 through 1.825 because the application does not contain, as a 
separate part of the disclosure on a paper copy, a "Sequence Listing" as required by 37 C.F.R. 
§ 1.821(c). 

Applicants submit herewith a response to the Notice to Comply which amends the 
specification to include a paper copy of the "Sequence Listing," which is also submitted 
herewith. 

Correction of Inventorship under 37 CFR Sl. 48(b) 

Applicant requests that several inventors be deleted, as these inventors' inventions are no 
longer being claimed in the present application as a result of prosecution. The fee as set forth in 
§ 1.1 7(i) is submitted herewith. 
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IDS 

Applicants submit herewith an IDS separately listing references recited in the BLAST 
report in order to be compliant with 37 C.F.R.§ 1.98(a)(1). Consideration of this Information 
Disclosure Statement is respectfully requested. 
Rejection under 35 U.S.C. § 101 and § 112, 1 st Paranraph 

The Examiner rejected Claims 1-6 as lacking specific utility under 35 U.S.C. § 101 
allegedly because the claimed invention is not supported by either a specific and substantial 
asserted utility or a well established utility under 35 U.S.C. § 101. Claims 1-6 are also rejected 
under 35 U.S.C. § 1 12, first paragraph for lack of utility. According to the Examiner, since the 
claimed invention is not supported by either a specific and substantial asserted utility or a well 
established utility, one skilled in the art would not know how to use the claimed invention. 
Applicants respectfully disagree with and traverse these rejections. 

The Examiner asserts that since the protein recited in the claims, PROl 180 (SEQ ED 
NO:28), is not supported by a specific and substantial asserted utility or well-established utility, 
the antibodies to this polypeptide also lack utility. 
Utility - Lezal Standard 

According to the Utility Examination Guidelines ("Utility Guidelines"), 66 Fed. Reg. 
1092 (2001) an invention complies with the utility requirement of 35 U.S.C. § 101, if it has at 
least one asserted "specific, substantial, and credible utility" or a "well-established utility." A 
utility is "specific" when it is particular to the subject matter claimed. For example, it is 
generally not enough to state that a nucleic acid is useful as a diagnostic without also identifying 
the condition that is to be diagnosed. 

The requirement of "substantial utility" defines a "real world" use, and derives from the 
Supreme Court's holding in Brenner v. Manson, 383 U.S. 519, 534 (1966) stating that "The basic 
quid pro quo contemplated by the Constitution and the Congress for granting a patent monopoly 
is the benefit derived by the public from an invention with substantial utility." In explaining the 
"substantial utility" standard, M.P.E.P. § 2107.01 cautions, however, that Office personnel must 
be careful not to interpret the phrase "immediate benefit to the public" or similar formulations 
used in certain court decisions to mean that products or services based on the claimed invention 
must be "currently available" to the public in order to satisfy the utility requirement. "Rather, any 
reasonable use that an applicant has identified for the invention that can be viewed as providing 
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a public benefit should be accepted as sufficient, at least with regard to defining a 'substantial' 
utility." (M.P.E.P. § 2107.01, emphasis added.) 

Indeed, the Guidelines for Examination of Applications for Compliance With the Utility 
Requirement, set forth in M.P.E.P. § 2107 11(B)(1) gives the following instruction to patent 
examiners: "If the applicant has asserted that the claimed invention is useful for any particular 
practical purpose . . . and the assertion would be considered credible by a person of ordinary skill 
in the art, do not impose a rejection based on lack of utility." 

Finally, the Utility Guideline restate the Patent Office's long-established position that any 
asserted utility has to be "credible." "Credibility is assessed from the perspective of one of 
ordinary skill in the art in view of the disclosure and any other evidence of record . . . that is 
probative of the Applicant's assertions." (M.P.E.P. 2107 H(B)(l)(ii)). Such standard is 
presumptively satisfied unless the logic underlying the assertion is seriously flawed, or if the 
facts upon which the assertion is based are inconsistent with the logic underlying the assertion 
(Revised Interim Utility Guidelines Training Materials, 1999). 

The evidentiary standard to be used throughout ex parte examination in setting forth a 
rejection is a preponderance of the totality of the evidence under consideration. In re Oetiker, 
977 F.2d 1443, 1445, 24 USPQ2d 1443, 1444 (Fed. Cir. 1992). Thus, to overcome the 
presumption of truth based on an assertion of utility by the Applicant, the Examiner must 
establish that it is more likely than not that one of ordinary skill in the art would doubt the truth 
of the statement of utility. Absolute predictability is not a requirement. Only after the Examiner 
has made a proper prima facie showing of lack of utility does the burden of rebuttal shift to the 
applicant. The issue will then be decided on the totality of evidence. 

Substantial Utility 

Applicants first address the PTO's argument the evidence of over-expression of the gene 
encoding the PRO 1 180 polypeptide in rectal tumor is insufficient because it does not teach what 
the normal level of expression is, it does not indicate how high the expression level is compared 
to the disease tissue, it lacks statistical correlation, there is no data to compare expression in the 
normal and disease samples, and that because the normal and tumor samples are not from the 
same person, there is no possibility of direct comparison between the normal and melanoma 
samples. Applicants submit that the gene expression data provided in Example 18 of the present 
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application are sufficient to establish a specific and substantial utility for the claimed antibodies 
to the PROl 1 80 polypeptides. 

The gene expression data in the specification, Example 18, shows that the mRNA 
associated with protein PROl 180 was more highly expressed in rectal tumor and normal kidney 
versus normal rectum and kidney tumor. Gene expression was identified using standard semi- 
quantitative PCR amplification reactions with cDNA libraries isolated from different human 
tumor and normal human tissue samples and analyzed by agarose gel electrophoresis so as to 
obtain a semi-quantitative determination of the level of expression of the PRO polypeptide- 
encoding nucleic acid in each reaction. Identification of the differential expression of the PRO 
polypeptide-encoding nucleic acid in one or more tumor tissues as compared to one or more 
normal tissues of the same tissue type rendered the molecule useful diagnostically for the 
determination of the presence or absence of tumor in a subject suspected of possessing a tumor as 
well as therapeutically as a target for the treatment of a tumor in a subject possessing such a 
tumor. Applicants submit herewith as Exhibit A the Declaration of J. Christopher Grimaldi, an 
expert in the field of cancer biology. This declaration was originally submitted in connection 
with co-pending application Serial No. 10/063,557. This declaration explains the importance of 
the data in Example 18, and how differential gene and protein expression studies are used to 
differentiate between normal and tumor tissue (see Declaration, paragraph 7). 

In paragraph 5 of his declaration, Mr. Grimaldi states that the gene expression studies 

reported in Example 18 of the instant application were made from pooled samples of normal and 

of tumor tissues. Contrary to the PTO's assertions that this makes the data unreliable, Mr. 

Grimaldi explains that: 

The DNA libraries used in the gene expression studies were made from pooled 
samples of normal and of tumor tissues. Data from pooled samples is more likely 
to be accurate than data obtained from a sample from a single individual That 
is, the detection of variations in gene expression is likely to represent a more 
generally relevant condition when pooled samples from normal tissues are 
compared with pooled samples from tumors in the same tissue type. 
(Paragraph 5) (emphasis added). 

In paragraphs 6 and 7, Mr. Grimaldi explains that the semi-quantitative analysis 

employed to generate the data of Example 18 is sufficient to determine if a gene is over- or 

under-expressed in tumor cells compared to corresponding normal tissue. He states that any 

visually detectable difference seen between two samples is indicative of at least a two-fold 

-8- 



Appl. No. : 10/063,530 

Filed : May 2, 2002 

difference in cDNA between the tumor tissue and the counterpart normal tissue. He also states 
that the results of the gene expression studies indicate that the genes of interest "can be used to 
differentiate tumor from normal." He explains that, contrary to the PTO's assertions, "The 
precise levels of gene expression are irrelevant; what matters is that there is a relative difference 
in expression between normal tissue and tumor tissue." (Paragraph 7). Thus, since it is the 
relative level of expression between normal tissue and suspected cancerous tissue that is 
important, the precise level of expression in normal tissue is irrelevant. Likewise, there is no 
need for quantitative data to compare the level of expression in normal and tumor tissue. As Mr. 
Grimaldi states, "If a difference is detected, this indicates that the gene and its corresponding 
polypeptide and antibodies against the polypeptide are useful for diagnostic purposes, to screen 
samples to differentiate between normal and tumor." 

The PTO also argues that because cancerous tissue can be aneuploid, and the data in the 
instant application was not corrected for aneuploidy, "a higher amplification of a gene does not 
necessarily mean higher expression or lower in a tissue, but can merely be an indication that the 
cancer tissue is aneuploid." Office Action at 7. The PTO relies on a single reference, Sen, 2000, 
Curr. Opin. Oncol. 12:82-88 (hereinafter Sen). 

Applicants agree that Sen teaches that most cancerous tissues are aneuploid, and that it is 
possible that the results reported in Example 18 may be due to aneuploidy in the tumor cells 
tested. However, Applicants fail to see how it is relevant to the utility of the claimed 
polypeptides, or their corresponding antibodies, whether the over and under-expression reported 
in Example 18 is due to aneuploidy or not. Regardless of whether the over- and under- 
expression of the gene encoding PRO1180 is a result of increased or decreased transcription of 
the gene, aneuploidy, or some other regulatory mechanism, the fact remains that it is over and 
underexpressed in rectal tumor and kidney tumor as compared to normal tissue, and it is 
therefore useful as a diagnostic tool for cancer since it can be used as a molecular marker for 
cancer. 

Applicants have established that the accepted understanding in the art is that there is a direct 
correlation between mRNA levels and the level of expression of the encoded protein 

The PTO argues that there is no supporting evidence that the PRO1180 polypeptide 
invention is over and under expressed in the tumor tissue compared to the normal tissue. The 
PTO also states that the literature reports that it does not necessarily follow that an increase in 
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gene copy number results in increased gene expression and increased polypeptide expression. 
Relying on Pennica et aL, 1998, PNAS USA 95:14717-14722 (hereinafter Pennica), the PTO 
states that one cannot extrapolate the expression data provided in the specification to support the 
implicit assertion that the PROl 180 polypeptides and antibodies can be used in cancer diagnosis 
or therapy. 

Applicants respectfully submit that the PTO is confusing the relationship between an 
increase in copy number of a gene or gene amplification on the one hand, and increased 
expression of a gene or mRNA expression on the other. The PTO focuses on the statement from 
Pennica that the WISP-2 gene DNA was amplified in colon tumors, but its mRNA expression 
was significantly reduced in the majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. Office Action at 7. As an aside, it should be noted that 
this result may not even be real, as the authors explain: "Because the center of the 20ql3 
amplicon [of which WISP-2 is a part] has not yet been identified, it is possible that the apparent 
amplification observed for WISP-2 may be caused by another gene in this amplicon." Pennica at 
14722 (emphasis added). 

However, even if the lack of correlation between DNA copy number and mRNA level in 
Pennica is real, Pennica says nothing about a lack of correlation between the level of mRNA and 
the level of protein expression - Pennica did not even look at protein expression. It is the 
correlation between mRNA level, as assessed by probing the cDNA library, and the level of 
protein expression which is at issue here, not the correlation of gene copy number and mRNA 
levels. The data Applicants report in Example 18 indicates that there are more copies of the 
mRNA encoding PROl 180 in rectal tumor and normal kidney than in normal rectum and kidney 
tumor. Nothing in Pennica is contrary to Applicants' assertion that it is well-established in the 
art that the level of protein is positively correlated to the level of mRNA. 

As stated above, the standard for utility is not absolute certainty, but rather whether one 
of skill in the art would be more likely than not to believe the asserted utility. Even if Pennica 
supported the PTO's argument, which it does not, one contrary example does not establish that 
one of skill in the art would find it is more likely than not, that in general, there is no correlation 
between mRNA level and protein levels. In fact, the working hypothesis among those skilled in 
the art is that there is a direct correlation between mRNA levels and protein levels. 



-10- 



Appl. No. : 10/063,530 

Filed : May 2, 2002 



Applicants further submit that it is generally well-understood in the art that in the 
majority of cases, gene expression correlates with levels of protein expression . In support of 
Applicants' position, Applicants submit herewith as Exhibit B a second Declaration by J. 
Christopher Grimaldi. This declaration was originally submitted in connection with co-pending 
application Serial No. 10/006,867. As stated in paragraph 5 of the declaration, "Those who 
work in this field are well aware that in the vast majority of cases, when a gene is over- 
expressed. . .the gene product or polypeptide will also be overexpressed." The references cited in 
the declaration and submitted herewith support this statement. 

Scientists regularly rely on the results of gene expression and even gene amplification 
results to point the way to differential protein expression in disease and, in this case, cancer. 
Submitted herewith as Exhibit C is the declaration of Dr. Paul Polakis, principal investigator of 
the Tumor Antigen Project of Genentech, Inc., the assignee of the present application. As Dr. 
Polakis explains, the primary focus of the microarray project was to identify tumor cell markers 
useful as targets for both the diagnosis and treatment of cancer in humans. The scientists 
working on the project extensively rely on results of microarray experiments in their effort to 
identify such markers. As Dr. Polakis explains, using microarray analysis, Genentech scientists 
have identified approximately 200 gene transcripts (mRNAs) that are present in human tumor 
cells at significantly higher levels than in corresponding normal human cells. To date, they have 
generated antibodies that bind to about 30 of the tumor antigen proteins expressed from these 
differentially expressed gene transcripts and have used these antibodies to quantitatively 
determine the level of production of these tumor antigen proteins in both human cancer cells and 
corresponding normal cells. Having compared the levels of mRNA and protein in both the tumor 
and normal cells analyzed, they found a very good correlation between mRNA and corresponding 
protein levels. Specifically, in approximately 80% of their observations they have found that 
increases in the level of a particular mRNA correlates with changes in the level of protein 
expressed from that mRNA . While the proper legal standard is to show that the existence of 
correlation between mRNA and polypeptide levels is more likely than not, the showing of 
approximately 80% correlation for the molecules tested in the Polakis Declaration greatly 
exceeds this legal standard. Based on these experimental data and his vast scientific experience 
of more than 20 years, Dr. Polakis states that, for human genes, increased mRNA levels typically 
correlate with an increase in abundance of the encoded protein. He further confirms that "it 
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remains a central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein." 

Additional references support this position. For example, Orntoft et al. (submitted 
herewith as Exhibit D) studied transcript levels of 5600 genes in malignant bladder cancers 
which were linked to a gain/loss of chromosomal material using an array-based method. Orntoft 
et al. showed that there was a gene dosage effect and teach that "in general (18 of 23 cases) 
chromosomal areas with more than 2-fold gain of DNA showed a corresponding increase in 
mRNA transcripts" (see column 1, abstract). In addition, Hyman et al (submitted herewith as 
Exhibit E) showed, using CGH analysis and cDNA microarrays to compare DNA copy numbers 
and mRNA expression of over 12,000 genes in breast cancer tumors and cell lines, that there is 
"evidence of a prominent global influence of copy number changes on gene expression levels" 
(see page 6244, column 1, last paragraph). Additional supportive teachings are also provided by 
Pollack et al. (submitted herewith as Exhibit F) who studied a series of primary human breast 
tumors and found that "...62% of highly amplified genes show moderately or highly elevated 
expression, that DNA copy number influences gene expression across a wide range of DNA copy 
number alterations (deletion, low-, mid- and high-level amplification), that on average, a 2-fold 
change in DNA copy number is associated with a corresponding 1.5-fold change in mRNA 
levels" (see column 1, abstract). Thus, these articles collectively teach that in general, gene 
expression correspondingly influences mRNA expression. 

Taken together, despite some teachings in the art of certain genes that do not fit within 
this paradigm which are exceptions rather than the rule, in the vast majority of cases , the 
combined teachings in the art, exemplified by Orntoft et al., Hyman et al. and Pollack et al. and 
the Grimaldi and Polakis declarations, overwhelmingly teach that gene expression influences 
protein levels. Thus, one of skill in the art would reasonably expect, in this instance, based on 
the gene expression data for the PRO 1 180 gene, that the PRO 1 180 protein is concomitantly over- 
expressed in normal kidney and rectal tumor. Thus, Applicants submit that the PRO 1 180 protein 
and antibodies against it have utility in the diagnosis of cancer and based on such a utility, one of 
skill in the art would know exactly how to use these molecules. 
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Claimed antibodies would have diagnostic utility even if the protein were not over- 
expressed 

Even assuming arguendo that, there is no correlation between gene expression and 

increased or decreased protein expression for PRO 1 180, which Applicants submit is not true, the 

polypeptide encoded by a gene that is over-expressed in cancer would still have a credible, 

specific and substantial utility. In support, Applicants submit herewith as Exhibit G the 

Declaration of Avi Ashkenazi, Ph.D., an expert in the field of cancer biology. Dr. Ashkenazi's 

Declaration explains that: 

even when amplification of a cancer marker gene does not result in significant 
over-expression of the corresponding gene product, this very absence of gene 
product over-expression still provides significant information for cancer diagnosis 
and treatment. Thus, if over-expression of the gene product does not parallel gene 
amplification in certain tumor types but does so in others, then parallel monitoring 
of gene amplification and gene product over-expression enables more accurate 
tumor classification and hence better determination of suitable therapy. In 
addition, absence of over-expression is crucial information for the practicing 
clinician. If a gene is amplified but the corresponding gene product is not over- 
expressed, the clinician accordingly will decide not to treat a patient with agents 
that target that gene product. 

Applicants submit that simultaneous testing of gene expression (or gene amplification) 
and gene product over-expression or under-expression enables more accurate tumor 
classification, even if the gene-product, the protein, is not over- or under-expressed. This leads 
to better determination of a suitable therapy. Further, as explained in Dr. Ashkenazi's 
Declaration, absence of over-expression of the protein itself is crucial information for the 
practicing clinician. If a gene is over-expressed in a tumor, but the corresponding gene product is 
not over-expressed, the clinician need not treat a patient with agents that target that gene product. 
This not only saves money, but further prevents unnecessary exposure of the patient to the side 
effects of gene product targeted agents. 

This is further supported by the teachings in the article by Hanna and Mornin, submitted 
herewith as Exhibit H. The article teaches that the HER-2/neu gene has been shown to be 
amplified and/or over-expressed in 10%-30% of invasive breast cancers and in 40-60% of 
intraductal breast carcinoma. Further, the article teaches that diagnosis of breast cancer includes 
testing both the amplification of the HER-2/neu gene (by FISH) as well as the over-expression of 
the HER-2/neu gene product (by D3C). Even when the protein is not over-expressed, the assay 
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relying on both tests leads to a more accurate classification of the cancer and a more effective 
treatment of it. 

The art indicates that, if a gene is over-expressed in cancer, it is more likely than not that 
the encoded protein will also be expressed at an elevated level. Even if not over-expressed, a 
polypeptide encoded by a gene that is over-expressed in cancer would still have utility. Thus, 
Applicants have demonstrated a credible, specific and substantial asserted utility for the PRO 
1180 polypeptide and antibodies against the PRO 1180 polypeptide. Based on the evidence and 
arguments presented herein, one skilled in the art, at the time the application was filed, would 
know how to use the claimed antibodies. 

Even if a prima facie case of lack of utility had been established, it should be withdrawn 
on consideration of the totality of evidence 

Applicants have provided several expert opinions and references supporting the utility of 
the present invention. Applicants submit that one of ordinary skill in the art would have no 
legitimate basis to doubt the credibility of the statements made by Mr. Grimaldi, and Dr. Polakis 
and Dr. Ashkenazi, and must treat as true the statements made by these experts. Applicant 
reminds the Examiner that "Office personnel must accept an opinion from a qualified expert that 
is based upon relevant facts whose accuracy is not being questioned; it is improper to disregard 
the opinion solely because of a disagreement over the significance or meaning of the facts 
offered." PTO Utility Examination Guidelines (2001). 

Given the totality of the evidence provided, Applicants submit that they have established 
a specific and substantial credible utility for the claimed proteins as diagnostic agents. 
According to the PTO Utility Examination Guidelines (2001), irrefutable proof of a claimed 
utility is not required. Rather, a specific and substantial credible utility requires only a 
"reasonable" confirmation of a real world context of use. Applicants submit that they have 
established that it is more likely than not that one of skill in the art would reasonably accept the 
utility for the PRO1180 polypeptide set forth in the specification. In view of the above 
arguments, Applicants respectfully request that the PTO reconsider and withdraw the utility 
rejections under 35 U.S.C. §101 and §112, first paragraph. 
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35 U.S.C. §112, second paragraph 

The Examiner rejected Claims 1-6 as indefinite for the recitation of "an antibody that 
binds" and an "antibody that specifically binds". Claim 6 has been cancelled and Claim 1 
amended to recite "specifically binds". Applicants submit that the term "specifically binds" has a 
well established meaning and is understood by those skilled in the art to mean that the antibody 
binds to a particular polypeptide, and does not significantly bind to any other polypeptide. Since 
claim terms should be given their ordinary, art-recognized meaning, the present rejection is 
believed to be misplaced, and should be withdrawn. 
Rejection under 35 U.S.C. $1 02(b) 

The Examiner rejected Claims 1, 2, 5 and 6 under U.S.C. § 102(b) as anticipated by 
Edwards et al. U.S. Patent No. 6,222,029, (issued April 2001) ('029). 

Applicants have established that they are entitled to the priority date of PCT/US00/23328, 
filed August 24, 2000 for the instantly claimed invention. The April 2001 issue date of the '029 
patent is not more than one year prior to the August 24, 2000 priority date of the present 
application. Accordingly, the '029 patent is not prior art under § 102(b), and the rejection should 
be withdrawn. 

Rejection under 35 U.S.C. §102(e) 

The Examiner rejected Claims 1-6 under U.S.C. § 102(e) as anticipated by Edwards et al. 
U.S. Patent No. 6,639,063 (filed July 21, 2000, issued October 28, 2003) ('063). 

The PTO states that SEQ ID NO:4227 described in the '063 patent has a local similarity 
of about 61% over the first 158 amino acids of SEQ ED NO:28. The overall similarity between 
the sequences, however, is only about 22%. 

The claims are directed to antibodies which specifically bind to the polypeptide of SEQ 
ED NO:28. As discussed above, the term "specifically binds" has a well established meaning and 
is understood by those skilled in the art to mean that the antibody binds to a particular 
polypeptide, and does not significantly bind to any other polypeptide. This, then means that the 
claimed antibodies bind to the polypeptide of SEQ ID NO:28 and not substantially to any 
polypeptide which is not SEQ ID NO:28. 
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The claimed antibodies which specifically bind to SEQ ID NO:28 would not substantially 
bind to SEQ ID NO:4227 described in the '063 patent. Thus, the '063 patent can not anticipate 
the claimed antibodies because every limitation of the claim is not taught in the '063 patent. 

Applicants therefore request that the rejection under § 102(e) be withdrawn. 

Rejection under 35 ILS.C. §103(a) 

Claims 3 and 4 are rejected as unpatentable over Edwards et al. '029 in view of Mack et 

al. U.S. Patent 6,294,343 (issued September 2001) ('343) 

As noted above, Applicants are entitled to priority to PCT/US00/23328, filed August 24, 
2000 for the instantly claimed invention. The April 2001 issue date of the '029 patent is not 
more than one year prior to the August 24, 2000 priority date of the present application. 
Accordingly, the '029 patent is not available as prior art under § 103(a). 

Applicants therefore request that the rejections under 35 U.S.C §103 (a) be withdrawn. 



The present application is believed to be in prima facie condition for allowance, and an 
early action to that effect is respectfully solicited. Applicants invite the Examiner to call the 
undersigned if any issues may be resolved through a telephonic conversation. 

Please charge any additional fees, including any fees for additional extension of time, or 
credit overpayment to Deposit Account No. 11-1410. 



Conclusion 



Respectfully submitted, 




KNOBBE, MARTENS, OLSON & BEAR, LLP 



Registration No. '48,868 
Agent of Record 
Customer No. 30,313 
(415) 954-4114 
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DECLARATION OF .L rijnKlgTnPHER GR1MA1 JPL UNDER 37 CFR y .m 

Commissioner for Patents 
P.O. Box 1450 
Alexandria, VA 22313-1450 

Dear Sin 

I, J. Christopher Grimaldi, declare and state as follows; 



1. 1 am a Senior Research Associate in the Molecular Biology Department of 
Gcncntcch, Inc., South San Francisco, CA 94080. 

2. My scientific Curriculum Vitae, including my list of publications, is attached to 
and fonns part of this Declaration (Exhibit A). 

3. I joined Gcnentech in Jamuny of 1999. From 1999 to 2003, 1 directed the Cloning 
Laboratory in the Molecular Biology Department. During this time 1 directed or performed 
numerous molecular biology techniques including semi-quantitative Polymerase Chain Reaction 
(PCR) analyses. T am currently involved, among other projects, in the isolation of genes coding 
for membrane associated proteins which can be used as targets for antibody therapeutics against 
cancer. In connection with the above-identified patent application, I personally performed or 
directed the semi-quantitative PCR gene expression analyses in the assay entitled "Tumor Versus 
Normal Differential Tissue Expression Distribution," which is described in EXAMPLE 18 in. the 
specification. These studies were used to identify differences in gene expression between tumor 
tissue and their normal counterparts. 

4. EXAMPLE 18 reports the results of the PCR analyses conducted as part of the 
investigating of several newly discovered DNfA sequences. This process included developing 
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primers and analyzing expression of the DNA sequences of interest in normal and tumor tissues. 
The analyses were designed to determine whether a difference exists between gene expression in 
normal tissues as compared to tumor in the same tissue type. 

5. The DNA libraries used -in the gene expression studies were made from pooled 
samples of normal and of tumor tissues. Data from pooled samples is more likely to be accurate 
than data obtained from a sample from a single individual. That is, the detection of variations in 
gene expression is likely to represent a more generally relevant condition when pooled samples 
from normal tissues are compared with pooled samples from tumors in the same tissue type. 

6. Tn differential gene expression studies, one looks for genes whose expression levels 
differ significantly under different conditions, for example, in normal versus diseased tissue. 
Thus, I conducted a semi-quantitative analysis of the expression of the DNA sequences of 
interest in normal versus lumor tissues. Expression levels were graded according to a scale of - 
, and +/- to indicate the amount of the specific signal detected. Using the widely accepted 
technique of PCR f it was determined whether the polynucleotides tested were more highly 
expressed, less expressed, or whether expression remained the same in tumor tissue as compared 
to its normal counterpart Because this technique relics on the visual detection of cthidium 
bromide staining of PGR products on agarose gels, it is reasonable to assume that any detectable 
differences seen between two samples will represent at least a two fold difference in cDNA. 

7. The results of the gene expression studies indicate that the genes of interest can be 
used to differentiate tumor from normal. The precise levels of gene expression arc irrelevant; 
what matters is that there is a relative difference in expression between normal tissue and tumor 
tissue. The precise type of tumor is also irrelevant; again, the assay was designed to indicate 
whether a difference exists between normal tissue and tumor tissue of the same type. If a 
difference is detected, this indicates that the gene and its corresponding polypeptide and 
antibodies against the polypeptide are useful for diagnostic purposes, to screen samples to 
differentiate between normal and tumor. Additional studies can then be conducted if further 
information is desired. 

8. I hereby declare that all statements made herein of my own knowledge arc true and 
that all statements made on information or belief arc believed to be true, and further that these 
statements were made with the knowledge that willful false statements and the like so made arc 
punishable by fine or imprisonment, or both, under Section 1 001 of Title 1 8 of the United States 
Code and that such willful statements may jeopardize the validity of the application or any 
patent issued thereon. 

Bv: li . W 

Christopher Grimaldi 
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Commissioner for Patents 

P.O. Box 1450 

Alexandria, VA 22313-1450 

I, J. Christopher Grimaldi, declare and say as follows: 

1. I am a Senior Research Associate in the Molecular Biology Department of 
Genentech, Inc., South San Francisco, CA 94080. 

2. I joined Genentech in January of 1999. From 1999 to 2003, 1 directed the Cloning 
Laboratory in the Molecular Biology Department. During this time I directed or performed 
numerous molecular biology techniques including qualitative Polymerase Chain Reaction (PCR) 
analyses. I am currently involved, among other projects, in the isolation of genes coding for 
membrane associated proteins which can be used as targets for antibody therapeutics against 
cancer. In connection with the above-identified patent application, I personally performed or 
directed the qualitative PCR analyses in the assay entitled "Tumor Versus Normal Differential 
Tissue Expression Distribution" which is described in EXAMPLE 18 in the specification that 
were used to identify differences in gene expression between tumor tissue and their normal 
counterparts. 

3. My scientific Curriculum Vitae, including my list of publications, is attached to 
and forms part of this Declaration (Exhibit A). 

4. Chromosomal aberrations, such as gene amplification, and chromosomal 
translocations are important markers of specific types of cancer and lead to the aberrant 
expression of specific genes and their encoded polypeptides. Gene amplification is a process in 
which specific regions of a chromosome are duplicated, thus creating multiple copies of certain 
genes that normally exist as a single copy. In addition, chromosomal translocations occur when 



AppLNo. : 10/006867 

Filed : December 6, 2001 

two different chromosomes break and are rejoined to each other chromosome resulting in a 
chimeric chromosome which displays a different expression pattern relative to the parent 
chromosomes. Amplification of certain genes such as Her2/Neu [Singleton et aL Pathol Annu,, 
27Ptl: 165-190], or chromosomal translocations such as t(5;14), [Grimaldi et aL, Blood . 
73(8):208 1-2085(1989); Meeker et aL, Blood , 76(2):285-289(1990)] give cancer cells a growth 
or survival advantage relative to normal cells, and might also provide a mechanism of tumor cell 
resistance to chemotherapy or radiotherapy. If the chromosomal aberration results in the aberrant 
expression of a mRNA and the corresponding gene product (the polypeptide), as they do in the 
aforementioned cases, then the gene product is a promising target for cancer therapy, for 
example, by the therapeutic antibody approach. 

5. Those who work in this field are well aware that in the vast majority of cases, 
when a gene is over-expressed, as evidenced by an increased production of mRNA, the gene 
product or polypeptide will also be over-expressed. It is unlikely that one identifies increased 
mRNA expression without associated increased protein expression. Stated in another way, two 
cell samples which have differing mRNA concentrations for a specific gene are expected to have 
correspondingly different concentration of protein for that gene. Techniques used to detect 
mRNA, such as Northern Blotting, Differential Display, in situ hybridization, quantitative PCR, 
Taqman, and more recently Microarray technology all rely on the dogma that a change in mRNA 
will represent a similar change in protein. If this dogma did not hold true then these techniques 
would have little value and not be so widely used. The use of mRNA quantitation techniques 
have identified a seemingly endless number of genes which are differentially expressed in 
various tissues and these genes have subsequently been shown to have correspondingly similar 
changes in their protein levels. Thus, the detection of increased mRNA expression is expected to 
result in increased polypeptide expression. The detection of increased polypeptide expression 
can be used for cancer diagnosis and treatment. 

6. However, even in the rare case where the protein expression does not correlate with the 
mRNA expression, this still provides significant information useful for cancer diagnosis and 
treatment. For example, if over-expression of a gene product does not correlate with over- 
expression of mRNA in certain tumor types but does so in others, then identification of both gene 
expression and protein expression enables more accurate tumor classification and hence better 
determination of suitable therapy. In addition, absence of over-expression of the gene product in 
the presence of a particular over-expression of mRNA is crucial information for the practicing 
clinician. If a gene is over-expressed but the corresponding gene product is not significantly 
over-expressed, the clinician accordingly will decide not to treat a patient with agents that target 
that gene product. 

7. I hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information or belief are believed to be true, and further that 
these statements were made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United 
States Code and that such willful statements may jeopardize the validity of the application or any 
patent issued thereon. 
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University of California, Berkeley 
Bachelor of Arts in Molecular Biology, 1984 



EMPLOYMENT EXPERIENCE 



SRA 



Genentech Inc., South San Francisco; 1/99 to present 



Previously, was responsible to direct and manage the Cloning Lab. Currently focused on 
isolating cancer specific genes for the Tumor Antigen (TAP), and Secreted Tumor Protein 
(STOP) projects for the Oncology Department as well as Immunologically relevant genes for the 
Immunology Department. Directed a lab of 6 scientists focused on a company-wide team effort 
to identify and isolate secreted proteins for potential therapeutic use (SPDI). For the SPDI project 
my duties were, among other things, the critically important coordination of the cloning of 
thousands of putative genes, by developing a smooth process of communication between the 
Bioinformatics, Cloning, Sequencing, and Legal teams. Collaborated with several groups to 
discover novel genes through the Curagen project, a unique differential display methodology. 
Interacted extensively with the Legal team providing essential data needed for filing patents on 
novel genes discovered through the SPDI, TAP and Curagen projects. My group has developed, 
implemented and patented high throughput cloning methodologies that have proven to be 
essential for the isolation of hundreds of novel genes for the SPDI, TAP and Curagen projects as 
well as dozens of other smaller projects. 



Scientist DNAX Research Institute, Palo Alto; 9/91 to 1/99 

Involved in multiple projects aimed at understanding novel genes discovered through 
bioinformatics studies and functional assays. Developed and patented a method for the specific 
depletion of eosinophils in vivo using monoclonal antibodies. Developed and implemented 
essential technical methodologies and provided strategic direction in the areas of expression, 
cloning, protein purification, general molecular biology, and monoclonal antibody production. 
Trained and supervised numerous technical staff. 



Directed plant-related activities, which included expansion planning, maintenance, safety, 
purchasing, inventory control, shipping and receiving, and laboratory management. Designed 
and implemented the safety program. Also served as liaison to regulatory agencies at the local, 
state and federal level. Was in charge of property leases, leasehold improvements, etc. 
Negotiated vendor contracts and directed the purchasing department. Trained and supervised 
personnel to carry out the above-mentioned duties. 
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The t(5;14) Chromosomal Translocation in a Case of Acute Lymphocytic 
Leukemia Joins the Interleukin-3 Gene to the Immunoglobulin Heavy Chain Gene 

By J. Christopher Grimaldi and Timothy C. Meeker 



Chromosomal translocations have proven to be important 
markers of the genetic abnormalities central to the patho- 
genesis of cancer. By cloning chromosomal breakpoints 
one can identify activated proto-oncogenes. We have stud- 
ied a case of B-lineage acute lymphocytic leukemia (ALL) 
that was associated with peripheral blood eosinophilia. The 
chromosomal translocation t(5;14) <q31;q32) from this 
sample was cloned and studied at the molecular level. This 

KARYOTYPIC STUDIES of leukemia and lymphoma 
have identified frequent nonrandom chromosomal 
translocations. Some of these translocations juxtapose the 
immunoglobulin heavy chain (IgH) gene with important 
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Fig 1. DMA blots of the leukemia sample. The restriction 
fragment pattern of normal human DNA (N) and the leukemia 
sample (U were compared using a human Jh probe. Rearranged 
bands are indicated by arrows. Sample L exhibits a single rear- 
ranged band with both Hind IH/EcoRI and SauZA restriction 
digests. The rearranged bands are less intense than the other 
bands because the majority of cells in the sample represent normal 
bone marrow elements. 



translocation joined the immunoglobulin heavy chain join- 
ing Uh) region to the promotor region of the interleukin-3 
(IL-3) gene in opposite transcriptional orientations. The 
data suggest that activation of the IL-3 gene by the 
enhancer of the immunoglobulin heavy chain gene may play 
a central role in the pathogenesis of this leukemia and the 
associated eosinophilia. 
e 1989 by Gruna & Stratton, Inc. 

protooncogenes, such as c-myc and oc/-2. u In this way, the 
IgH gene can activate proto-oncogenes, resulting in disor- 
dered gene expression and a step in the development of 
cancer. The investigation of additional nonrandom transloca- 
tions into the IgH locus allows us to identify new genes 
promoting the generation of leukemia and lymphoma. 

A distinct subtype of acute lymphocytic leukemia (ALL) 
has been characterized by B-lineage phenotype, associated 
eosinophilia in the peripheral blood, and a t(5;14)(q31;q32) 
chromosomal translocation. 3 - 4 This syndrome probably 
occurs in <1% of all patients with ALL. We hypothesized 
that the cloning of the translocation characteristic of this 
leukemia might allow the identification of an important gene 
on chromosome 5 that plays a role in the evolution of this 
disease. In this report we demonstrate that the interleukin-3 
gene (IL-3) and the IgH gene are joined by this transloca- 
tion. 

MATERIALS AND METHODS 

Sample and DNA blots, A bone marrow aspirate from a repre- 
sentative patient with ALL (LI morphology by French-American- 
British [FAB] criteria), peripheral eosinophilia (up to 20.000 per 
microliter with a normal value of <350 per microliter) and a 
t(5;14)(q3l;q32) translocation was studied. Using published meth- 
ods, genomic DNA was isolated and DNA blots were made. 5 Briefly, 
10 jig of high molecular weight (mol wt) DNA were digested using 
an appropriate restriction enzyme and electrophoresed on a 0.8% 
agarose gel. The gel was stained with ethidtum bromide, photo- 
graphed, denatured, neutralized, and transferred to Hybond ( Amer- 
sham, Arlington Heights, IL). After treatment of the filter with 
ultraviolet light, hybridization was performed. The filter was washed 
to a final stringency of 0.2% saturated sodium citrate (SSC) and 
0.1% sodium lauryly sulfate (SDS) and exposed to film. The human 
Jh probe has been previously reported. 6 

Genomic library. The genomic library was made using pub- 
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Activation of the InterIeukin-3 Gene by Chromosome Translocation in Acute 
Lymphocytic Leukemia With Eosinophilia 

By Timothy C. Meeker, Dan Hardy, Cheryl Willman, Thomas Hogan, and John Abrams 



The t{5;1 4){q31 ;q32) translocation from B-Hneage acute 
lymphocytic leukemia with eosinophilia has been cloned 
from two leukemia samples. In both cases, this transloca- 
tion joined the IgH gene end the interleukin-3 (IL-3) gene. In 
one patient, excess IL-3 mRNA was produced by the 
leukemic cells. In the second patient, serum IL-3 levels 
were measured and shown to correlate with disease 

A NUMBER OF chromosome translocations have been 
associated with human leukemia and lymphoma. In 
many cases the study of these translocations has led to the 
discovery or characterization of proto-oncogenes, such as 
bcU2 t c-abl t and c-myc, that are located adjacent to the 
translocation.'- 2 It is now widely understood that cancer- 
associated translocations disrupt nearby proto-oncogenes. 

A distinct subtype of acute leukemia is characterized by 
the triad of B-lineage immunophenotype, eosinophilia, and 
the t(5;14)(q31;q32) translocation. 1 - 4 Leukemic cells from 
such patients have been positive for terminal deoxynucleotidyl 
transferase (Tdt), common acute lymphoblastic leukemia 
antigen (CALLA), and CD 19, but negative for surface or 
cytoplasmic immunoglobulin. In previous work, we cloned 
the t(5;14) breakpoint from one leukemic sample (Case 1) 
and determined that the IgH and interleukin-3 (IL-3) genes 
were joined by this abnormality.* In this report, we extend 
those findings by showing that the t(5;14)(q31;q32) translo- 
cation from a second leukemia sample (Case 2) has a similar 
structure, and we report our study of growth factor expres- 
sion in these patients. 

MATERIALS AND METHODS 

Samples and Southern blots. Case 1 has been described, 5 * 6 
Clinical features of Case 2 have been described in detail. 3 DNA 
isolation and Southern blotting was done using previously described 
methods. 9 Filters were hybridized with an immunoglobulin Jh probe, 
a 280 bp BamHl/EcoRl genomic IL-3 fragment, and an IL-3 
cDNA probe. 7 - 1 

Northern blots. RNA isolation and Northern blotting have been 
described. 9 Briefly, Northern blots were done by separating 9^g 
total RNA on 1% agarose-formaldehyde gels. Equal RNA loading in 
each lane was confirmed by ethidium bromide staining. Blots were 
hybridized with an IL-3 cDNA probe extending to the Xho I site in 
exon 5, a 720 bp Sst 1/ Kpn I probe derived from intron 2 of the IL-3 
gene, a 600 bp Nhe l/Hpa I IL-5 cDNA probe, and a 500 bp Pst 
l/Nco I granulocyte-macrophage colony stimulating factor (GM- 
CSF) cDNA probe. 10 -' 2 

Polymerase chain reaction. Primers were designed with BamHl 
sites for cloning. One primer hybridized to the Jb sequences from the 
IgH gene (Primer 1 44:5'-TAGG ATCCGACGGTGACCAGGGT), 
and the other hybridized to the region of the TATA box in the IL-3 
gene (Primer 1 61: 5-AACAGGATCCCGCCTTATATGTGCAG). 
Polymerase chain reaction (PCR) (95°C for I ininute, 61°C for 30 
seconds, and 72°C for 3 minutes) was done using 500 ng genomic 
DNA and 50 pmol of each primer in 100 /iL containing 67 mmol/L 
Tris-HCl pH 8.8, 6.7 mmol/L MgC] a , 10% dimethyl sulfoxide 
(DMSO), 170 jig/mL bovine serum albumin (BSA) (fraction V), 



activity. There was no evidence of excess granulocyte/ 
macrophage colony stimulating factor (GM-CSF) or IL-5 
expression. Our data support the formulation that this 
subtype of leukemia may arise in part because of a 
chromosome translocation that activates the IL-3 gene, 
resulting in autocrine and paracrine growth effects. 
© 1990 by The American Society of Hematology. 

16.6 mmol/L ammonium sulfate, 1.5 mmol/L each dNTP and Taq 
polymerase (Perkin-Elmer, Norwalk, CT).° 

Sequencing. Sequencing was done by chain termination in Ml 3 
vectors. 14 As part of this study, we sequenced a subclone of a normal 
IL-3 promotor, covering 598 base pairs from a Sma I site at position 
- 1240 (with respect to the proposed site of transcription initiation) 
to an Nhe I site at position - 642. The plasmid containing this region 
was a gift from Naoko Arai of the DNAX Research Institute. 

Expression in Cos7 cells. A genomic IL-3 fragment from Case 1 
was cloned into the pXM expression vector. 10 Briefly, the Hindlll/ 
Sal I fragment containing the IL-3 gene was subcioned from the 
previously described phage clone 4 into pUC18. J The 2.6 kb 
fragment extending from the Sma I site 61 bp upstream of the IL-3 
transcription start to the Sma I site in the polylinker was cloned into 
the blunted Xho I site of pXM. The negative control construct was 
the pXM vector without insert. Plasmids were introduced into Cos7 
cells by electroporation, and supernatant was collected after 48 
hours in culture. 

TFl btoassay. TF-1 cells were passaged in RPMI 1 640 supple- 
mented with 10% heat-inactivated fetal bovine scrum, 2 mmol 
L-giutamine, and 1 ng/mL human GM-CSF." Samples and antibod- 
ies were diluted in this same medium lacking GM-CSF but contain- 
ing penicillin and streptomycin. A25pL volume of serial dilutions of 
patient serum was added to wells in a flat bottom 96-well microtiter 
plate. Rat anti-cytoldne monoclonal antibody in a volume of 25 pL 
was added to appropriate wells and preincu bated for 1 hour at 37°C 
Fifty microliters of twice washed TF-1 cells were added to each well* 
giving a final cell concentration of 1 x 10 4 cells per well (final 
volume, 100 pL). The plate was incubated for 48 hours. The 
remaining cell viability was determined metabolically by the colori- 
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Clinical and Pathologic Significance of the 
c-erbB-2 (HER-2/neu) Oncogene 

Timothy R Singleton and John G. Strickler 



The c-erbB-2 oncogene was first shown to have clinical significance in 1987 by 
Slamon et al, 79 who reported that c-erfeB-2 DNA amplification in breast carcino- 
mas correlated with decreased survival m patients with metastasis to axillary 
lymph nodes. Subsequent studies, however, of oer&B-2 activation in breast 
carcinoma reached conflicting conclusions about its clinical significance. This 
oncogene also has been reported to have clinical and pathologic implications in 
other neoplasms. Our review summarizes these various studies and examines 
the clinical relevance of c-eriB-2 activation, which has not been emphasized in 
recent reviews. 37 ^ The molecular biology of the c-eriB-2 oncogene has been 
extensively reviewed 37 39 * 55 and will be discussed only briefly here. 



BACKGROUND 



I The c-erfoB-2 oncogene was discovered in the 1980s by three lines of mvestiga- 

tion. The neu oncogene was detected as a mutated transforming gene in 
neuroblastomas induced by ethylnitrosurea treatment of fetal rats. 6 The c- 

.•j er&B-2 was a human gene discovered by its homology to the retroviral gene v- 

eriB. 33 ' 49 ' 78 HER-2 was isolated by screening a human genomic DNA library for 
homology with v-eriB. 24 When the DNA sequences were determined subse- 
quently, c-eriB-2, HER-2> and neu were found to represent the same gene. 
Recently, the c-erfcB-2 oncogene also has been referred to as NGL. 

I The c-erfeB-2 DNA is located on human chromosome 17q21 24JM6 and codes 

for c-erbB-2 mENA (4.6 kb), which translates c-erfcB-2 protein (pl85). This 
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DECLARATION OF PAUL POLAKIS, PhD. 
I, Paul Polakis, Ph,D., declare and say as follows: , 

1. I was awarded a Ph.D. by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and fonns 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3 As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins", When such a tumor antigen protein is 
identified, one can produce an antibody mat recognizes and binds to that protein 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4^ In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells 
at genomic DNA, mRNA and protein levels. An important example of one such ' 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another In the 
course of our research using microarray analysis, We have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells To date we 
have generated antibodies that bind to about 30 of the tumor antigen proteins ' ' 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells We 
have then compared the levels of mRNA and protein in both the tumor and normal 
ceils 3n&lyzed. 

5 From the mRNA and protein expression analyses described in paragraph 4 
above we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



I 



expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including ;the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature ,it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. . While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
arid further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
thereon. 
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Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
Non-invasive and Invasive Human Transitional 
Cell Carcinomas* 

Torben F. 0riitoft|:§, Thomas ThykjaeiH, Frederic M. Waldman||, Hans Wolf** 
and Julio E. Celisfl: 



Gain and loss of chromosomal materia! is characteristic 
of bladder cancer, as well as malignant transformation in 
general. The consequences of these changes at both the 
transcription and translation levels is at present unknown 
partly because of technical limitations. Here we have at- 
tempted to address this question in pairs of non-Invasive 
and invasive human bladder tumors using a combination 
of technology that included comparative genomic hybrid* 
ization, high density oligonucleotide array-based monitor- 
ing of transcript levels (5600 genes}, and high resolution 



phenomenon at both the transcription ahd translation levels. 
High throughput array studies of the breast cancer cell line 
BT474 has suggested, that there is a correlation between 
DNA copy numbers and gene expression In highly amplified 
areas (2), and studies of Individual genes in solid tumors 
have revealed a good correlation between gene dose and 
mRNA or protein levels In the case of c-err>B2, cycltn dj, 
em$1, and N-myc (3-5), However, a high cyclin D1 protein 
expression has been observed without simultaneous am- 



two-dimensfonal gel electrophoresis/the results showed A^P ,iflcatlon ( 4 )> and * l ow ,e vel of c-myc copy number In- 



that there Is a gene dosage effect mat in some cases 
.superimposes on other regulatory mechanisms. This ef- 
fect depended (p < 0.015) on the magnitude of the com- 
parative genomic hybridization change. In general (18 of 
23 cases), chromosomal areas with more than 2-fold gain 
of DNA .showed a corresponding increase in mRNA tran- 
scripts. Areas with loss of DNA, on the other hand, 
showed either reduced or unaltered transcript levels} Be- 
cause most proteins resolved by. two-dimensional gels 
are unknown it was only possible to compare mRNA and 
protein alterations in relatively few cases of well focused 
abundant proteins. QlVith few exceptions we found a good 
correlation (p < 0.005) between transcript alterations and 
protein levels. The implications, as well as limitations, 
of the approach are discussed. Molecular & Cellular 
Proteomlcs 1:37-45, 2002. 



Aneuploidy Is a common feature of most human cancers 
(1), but little Is known about the genome-wide effect of this 
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crease was observed without concomitant c-myc protein 
overexpression (6). 

In human bladder tumors, karyotyping, fluorescent in situ 
hybridization, arid comparative genomic hybridization (CGH) 1 
have revealed chromosomal aberrations that seem to be 
characteristic of certain stages of disease progression jn the 
case of non-Invasive pTa transitional ceil carcinomas (TCCs), 
this Includes loss of chromosome 9 or.parts of it, as well as 
Iqss of Y in males. In minimally Invasive pT1 TCCs, the fol- 
lowing alterations have been reported: 2q-, 11 p~, lq+, 
11q13+, 17q+, and 20q+ (7-12). It has been suggested that 
these regions harbor tumor suppressor genes and onco- 
genes; however, the large chromosomal areas Involved often 
contain many genes, making meaningful predictions of the 
functional consequences of losses and gains very difficult. 

In this investigation we have combined genome-wide tech- 
nology for detecting genomic gains and losses (CGH) vvfth 
gene expression prof ling techniques (mlcroarrays and pro- 
teomlcs) to determine the effect of gene copy number on 
transcript and protein levels in pairs of non-Invasive and In- 
vasive human bladder TCCs. 

EXPERIMENTAL PROCEDURES 

Materia!— Bladder tumor biopsies were sampled after Informed 
consent was obtained and after removal of tissue for routine pathol- 
ogy examination. By light microscopy tumors 335 and 532 were 
staged by an experienced pathologist as pTa (superficial papillary), 

1 The abbreviations used are: CGH, comparative genomic hybrid- 
ization; TCC, transitional cell carcinoma; LOH, loss of heterozygosity; 
PA-FABP, psoriasis-associated fatty acld-bfndfng protein; 2D f 
two-dimensional. • 
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Fig. .1 . DNA copy number and mRNA expression (evel. Shown from left to right are chromosome (Chr.), CGH profiles, gene location and 
expression level of specific genes, and overall expression level along the chromosome. A, expression of mRNA In Invasive tumor 733 as 
compared with the non-Invasive counterpart- tumor 335. a, expression of mRNA In Invasive tumor 827 Compared with the non-Invasive 
counterpart tumor 532. The average fluorescent signal ratio between tumor DNA and normal DNA Is shown along the length of the chromosome 
(ten). The bold curve in the ratio profile represents a mean of four chromosomes and is surrounded by thin curves Indicating one standard 
deviation. The centra/ vertical line (broken) Indicates a ratio value of 1 (no change), and. the vertical lines next to it (dotted) indicate a ratio of 
0.5 (ler» and 2.0 {fight). In chromosomes where the noninvasive tumor 335 used for comparison showed alterations in DNA content, the ratio 
profile of that chromosome Is shown to the right of the Invasive tumor profile. The colored bars represents one gene each. Identified by the 
running numbers above the oars (the name of the gene can be seen at www.MDLDK/sdata.html). The bars Indicate the purported location of 
the gene, and the colors indicate the expression level of the gene In the invasive tumor compared with the non-invasive counterpart; >2-fold 
Increase (b/acA), >2-fold decrease (blue), no significant change {orange). The bar to the far right, entitled Expression shows the resulting change 
In. expression along the chromosome; the coiors indicate that at least half of the genes were up-regulated (b/acA), at least half of the genes 
down-regulated (b/ue), or more than half of the genes are unchanged (orange), if a gene was absent in one of the samples and present In 
another. It was regarded as more than a 2-fold change. A 2-fold level was chosen as this corresponded to one standard deviation in a double 
determination of ~1800 genes. Centromeres and heterochromatic regions were excluded from data analysis. 



grade I and II, respectively, tumors 733 and 827 were staged as pT1 
(invasive into submucosal 733 was staged as solid, and 827 was 
staged as papillary, both grade 111. 

mRNA Preparation— l\ssue biopsies, obtained fresh from surgery, 
were embedded immediately In a sodlum-guanldlnlum thiocyanate 
solution and stored at -80 4 C. Total RNA was isolated using the 
RNAzol B RNA Isolation method (WAK-Chemie Medical GMBH). 
polyW* RNA was Isolated by an. ol!go{rfT) selection step (Oligotex 
mRNA Wt; Qlagen). 

cRNA Preparation— 1 /xg of mRNA was used as starting material. 
The first and second strand cONA synthesis was performed using the 
Superscript® choice system (Invitrogen) according to the manufac- 
turer's Instructions but using an o(igo(dT) primer containing aT7 RNA 
polymerase binding site. Labeled cRNA was prepared using the ME* 
GAscrip® In vitro transcription kit (Amblon). Biotln-labeled CTP and 



UTP (Enzo) was used, together with unlabeled NTPs In the reaction. 
Following the In vitro transcription reaction, the. unincorporated nu- 
cleotides were removed using RNeasy columns (Qiagen). 

Array Hybridization and Scanning— Amy hybridization and scan- 
ning was modified from a previous method (13). 10 /xg of cRNA was 
fragmented at 94 °C for 35 min in buffer containing 40 mM Tris 
acetate, pH 8,1. 100 mM KOAc, 30 mM MgOAc. Prior to hybridization, 
the fragmented cRNA in a 6x SSPErY hybridization buffer (1 m NaCI, 
10 mM Tris, pH 7.6, 0.005% Triton), was heated to 95 °C for 5 min, 
subsequently cooled to 40 °C, and loaded onto the Aftymetrix probe 
array cartridge. The probe array was then incubated tor 18 h at 40 "C 
at constant rotation (60 rpm). The probe array was exposed to 10 
washes In 6x SSPE-T at 25 °C followed by 4 washes In 0.5X SSPE-T 
at 5u fl C. The biotinylated cRNA was stained with a streptavidin- 
phycoerythrin conjugate, 10 jxg/rhl (Molecular Probes) In 6x SSPE-T 
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Fig. A—continued 



for 30 mln at 25 °C followed by 1 0 washes In 6 X SSPE-T at 25 °C, The 
probe arrays were scanned at 560 nm using a confocal laser scanning 
microscope (made for Asymetrix by Hewlett-Packard!). The readings 
from the quantitative scanning were analyzed by Asymetrix gene 
expression analysis software. 

Mlcrosatellite Analysis— Microsateilite Analysts was performed as 
described previously (14). Microsatellites were selected by use of 
wwwjwblJilmjiih.gov/genemap98, and primer sequences were ob- 
tained from the genome database at www.gdb.org. DNA was extracted 
from tumor and Wood and amplified fcy PCR In a volume of 20 /J for 35 
cycles. The ampllcons were denatured and electrophoresed for 3 h In an 
ABI Prism 377. Data were coBected In the Gene Scan program for 
fragment analysis. Loss of heterozygosity was defined as less man 33% 
of one allele detected In tumor ampllcons compared with blood. 

Proteomfc Analysis— TCCs were minced Into small pieces and 
homogenized In a small glass homogehlzer In 0.5 ml of lysis solution. 
Samples were stored at -20 °C until use. The procedure for 2D gel 
electrophoresis has been described In detail elsewhere (15, 16). Gels 
were stained with silver nitrate and/or Coomassie Brilliant Blue. Pro- 
teins were Identified by a combination of procedures that included 
microsequehcing, mass spectrometry, two-dimensional gel Western. 
Immunobtottlng, and comparison with the master two-dimensional gel 
image of human Keratinocyte proteins; see brobase.dK/cgl-bIn/celis. 

CGH— Hybridization of differentially labeled tumor and normal DNA 
to normal metaphase chromosomes was performed as described 
previously (10). Fluorescein-labeled tumor DNA (200 ng), Texas Red- 



labeled reference DNA (200 ng), and human Cot-1 DNA (20 /xg) were 
denatured at 37 °C for 5 min and applied to denatured normal met- 
aphase slides. Hybridization was at.37 °C for 2 days. After washing, 
the slides were counterstalned with 0.16 jtg/ml 4 r 6-<fiamWino-2-phe- 
nylindole in an anti-fade soiutioru A second hybridization was per- 
formed for all tumor samples using fluorescein-labeled reference DNA 
and Texas Red-labeled tumor DNA (inverse labeling) to confirm the 
aberrations detected during the Initial' hybridization. Each CGH ex- 
periment also Included a normal control hybridization using fluores- 
cein- and Texas Red-labeled normal DNA. Digital Image analysis was 
used to identify chromosomal regions with abnormal fluorescence 
ratios, indicating regions of DNA gains and losses. The average 
greeri:red fluorescence intensity ratio profiles were calculated using 
four images of each chromosome (eight chromosomes total) with 
normalization of the greemred fluorescence Intensity ratio for the 
entire metaphase and background correction. Chromosome Identifi- 
cation was performed based on 4,6-diamidlno-2-phenyiindoie band- 
ing patterns. Only images showing uniform high Intensity fluores- 
cence with minimal background staining were analyzed. All 
centromeres, p arms of acrocentric chromosomes, and heterochro- 
matic regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridlzatlon-The CGH analysis 
identified a number of chromosomal gains and. losses In the 
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Table I 

. Correlation between alterations detected by CQH and by expression monitoring 

Top. CGH used as Independent variable (If CGH alteration - what expression ratio was found); tybttom, altered expression used , as 
independent variable (If expression alteration — what CGH deviation was found). ■ * 1 . 1 



CGH alterations 



Tumor 733 vs. 335 
Expression change clusters 



Concordance CGH alterations 



Tumor 827 vs. 532 
Expression change clusters 



Concordance 



13 Gain 
10 Loss 



10 Up-regulatlon 

0 Down-regulation 

3 No change 

1 Up-regutation 

5 Down-regulation 

4 No change 



77% 



50% 



10 Gain 



12 Loss 



8 Up-regulation 
0 Down-regulation 

2 No change 

3 Up-regulatlon 

2 Down regulation 
7 No change 



80% 



17% 



Expression change clusters 



Tumor 733 vs. 335 
CGH alterations 



Concordance Expression change clusters 



Tumor 827 vs. 532 
CGH alterations 



Concordance 



16 Up-regulation 
21 Down-regulation 
15 No change 



11 Gain 

2 Loss 

3 No change 
1 Gain 

8 Loss 

12 No change 
3 Gain 

3 Loss 

9 No change 



69% 
38% 
60% 



17 Up-regulation 

9 Down-regulation 

21 No change 
I 



10 Gain 

5 Loss 

2 No change 

0 Gain 

3 Loss 

6 No change 

1 Gain 
3 Loss 

17.No change 



59% 
33% 
81% 



two Invasive tumors (stage pT1 , TCCs 733 and 827), whereas 
the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p-, 9q22-q33-, and X-, and 7+, 9q-, 
and respectively. Both Invasive tumors showed changes 
(1q22-24-H/ 2q14.1jqter-, 3q12-q13.3~, 6q12-q22~, 
9q34+, 11qi2^q13+, 17+, and 20q1 1 .2-q1 2+) that are typ- 
ical for their disease stage, as well as additional alterations, 
some of which are. shown In Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4-fold 
in the case of TCC 733 and 0.3-fold for TCC 827. The largest 
changes, amounting to at least a doubling of chromosomal 
content, were* observed at 1q23 In TCC 733 (Fig. 14) and 
20q12 In TCC 827 (Fig. 1B). 

mRNA Expression In Relation to DNA Copy Number— The 
mRNA levels from the two invasive tumors (TCCs 827 and 
.733) were compared with the two non-invasive counterparts 
(TCCs 532 and 335). This was done In two separate experi- 
ments In which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1,800 genes that yielded a signal on the arrays 
were searched In the Unlgerie and Genemap data bases, for 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their purported locus. In 
that way It was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level in 
the Invasive versus the non-lnvaslve counterpart. Bars, which 
represent chromosomal location of a gene, Were color-coded 
according to the expression ratio, and only differences larger 



than 2-fold were regarded as informative (Rg. 1). The density 
of genes along the chromosomes varied, and areas contain* 
Ing only one gene were excluded from the calculations. The 
resolution of the QGH method is very low, and some of the 
outlier data may be because of the factthat the boundaries of 
the chromosomal aberrations are not known at high resolution. 

Two sets of calculations were made from the data- For the 
first set we used CGH alterations as the Independent variable 
and estimated the frequency of expression alterations in these 
chromosomal areas. In general, areas with a strong gain of 
chromosomal material contained a. cluster of genes having 
Increased mRNA expression. For example, both chromo- 
somes 1q21-q25, 2p and 9q, showed a relative gain of more 
than 100% In DNA copy number that was accompanied by 
increased mRNA expression levels In the two tumor pairs (Rg. 
1). In most cases, chromosomal gains detected by CGH were 
accompanied , by an increased level of transcripts In both 
TCCs 733 (77%) and 827 (80%) (Table I. top). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression In several cases, and were often* regis- 
tered as having unaltered RNA levels (Table I, top). The Inabil- 
ity to detect RNA expression changes In these cases was not 
because of fewer genes mapping to the lost regions (data not 
. shown). 

In the second set of calculations we selected expression 
alterations above 2-fold as the Independent variable and es- 
timated the frequency of CGH alterations in these areas. As 
above, we found that increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 627, 59%), whereas reduced expression was often de- 
tected in areas with unaltered CGH ratios (table I, bottom). 
Furthermore, as a control we looked at areas with no alter- 
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A1A4 
A 



A A 
AAA 

A 



A • 



' A 
' AAA 



AAA 
AA 
A 



Expression changes 



Expression changes 
not detected 



Expression changes 
detected 



Expression cheng es 
not detected 



Tumor 827 versus 532 Tumor 733 versus 335 

Fig. 2. Correlation between maximum CQH aberration and the ability to detect expression change by oligonucleotide array 
monitoring. The aberration Is shown as a numerical -fold change In ratio between Invasive tumors 827 (A) and 733 (♦) and their hon-lnvaslve 
counterparts 532 and 335. The expression change was taken from the Expression line to the right In Fig. 1, which depicts the resulting 
expression change for a given chromosomal region. At least half of the mRNAs from a given region have to be elther.up- or down-regulated 
to be scored as an expression change. Ail chromosomal arms in which the CGH ratio plus or minus one standard deviation was outside the 
ratio value of cm© were Included: 



atlon in expression. No alteration was detected by CGH in 
most of these areas (TCC 733, 60% and TCC 827, 81%; see 
Table I, bottom). Because the ability to observe reduced or 
increased mRNA expression clustering to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations In the 
regions showing CGH changes against the ability to detect a 
change in mRNA expression as monitored by the oligonucleo- 
tide arrays (Fig. 2)(fpr both tumors TCC 733 {p< 0.015) and 
TCC 827 (p < 0.00003). a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
the DNA copy number) and alterations detected by the array 
based technology (Fig. 2^ Similar data were obtained when 
areas with altered expression were used as Independent vari- 
ables. These areas correlated best with QGH when the CGH 
ratio deviated 1 .6- to 2.0-fold (Table I, bottom) but mostly did 
hot at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction In expres- 
sion level, which Is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
much larger extent 

MicrosatQlllte-based Detection of Minor Areas of Loss- 
es— In TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Rg. 1, TCC 733 
chromosome 1q32, 2p21, and 7q21 and q32, 9q34, and 
10q22). To determine whether these results were because of 
undetected loss of chromosomal material In these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two microsatellltes positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Rg. 3). Additionally, chromosome 2p in 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript increase/decrease/in- 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Rg. 1/4). As Indicated above, the mRN A decrease observed In 
the middle of the chromosomal gain was because of LOH, 
Implying that one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material. However, this cannot be detected with 
the resolution of the CGH method. 

In both TCC 733 and TCC 827; the telomerlo end of chro- 
mosome 11p showed a normal ratio in the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two microsatellltes (D11S1760, D11S922) 
positioned close to MUC2, iGF2,.and cathepsln D Indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression of mRNA observed In TCC 733 at 
chromosomes 3q24, 11p11, 12p12.2, 12q21.1, and 16q24 
and in TCC 827 at chromosome 11p15.5; 12p11, 15q11.2, 
and 18q12 was also examined for chromosomal losses using 
microsatellltes positioned as close as possible to the gene loci 
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Fig: 3. Mlcrosatellite analysis of loss of heterozygosity. Tumor 
733 showing loss of heterozygosity at chromosome 1q25, detected 
(a) by D1S215 close to Hu class I histocompatibility antigen (gene 
number 38 in Rg: 1), (6) by D1S2735 close to cathepsln E (gene 
number 41 In Fig. 1), arid (c) at chromosome 2p23 by D2S2251 close 
to general ^-spectrin (gene number 1 1 on Fig. 1) and of (o) tumor 827 
showing loss of heterozygosity at chromosome 18q12 by S18S11 18 
close to mitochondrial 3-oxoacyteoenzyme A thiblase (gene number 
12 In Fig. U The upper curves show the electropherogram obtained 
from normal DNA from leukocytes (A/), and the tower curves show the 
electropherogram from tumor DNA (7): In all cases one allele Is 
. partially lost In the tumor ampHcon. . 

showing reduced mRNA transcripts. Only the rhlcrosatelilte 
positioned at 18q12 showed LOH (Fig. 3), suggesting that 
transcriptional down-regulation of genes In the other regions 
may be controlled by other mechanisms. 

Relation between Changes In mRNA and Protein Levels— 
2D-PAGE analysis, In combination with Coomassle Brilliant 
Blue and/or silver staining, was carried out on all four tumors 
using fresh" biopsy material. 40 well resolved abundant known 
proteins migrating In areas away from the edges of the pH 
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A B 
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A. 
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Reduced 
prikpin' 



Unaltered 
protein 



Increased 
protein. 



Fig. 4. Correlation between protein levels as judged by 20- 
PAGE and transcript ratio. For comparison proteins were divided In 
three groups, unaltered In level or up- or down-regulated horizontal 
axis). The mRNA ratio as determined by oligonucleotide arrays was 
plotted for each gene {vertical axis). A, mRNAs that were scored as 
present in. both tumors used for the ratio calculation; A, mRNAs that 
were scored as absent in the Invasive tumors (along horizontal axis) or 
as absent in noninvasive reference {top of figure). Two different 
. scalings were used to. exclude scaling as a compounder, TCCs 827 
and 532 (AA) were scaled with background suppression, and TCCs 
733 and 335 (eC) were scaled without suppression. Both compari- 
sons showed highly significant (p < 0.005) differences In mRNA ratios 
between the groups'. Proteins shown were as follows: Group* prom 
left), phosphoglucomutase 1, glutathione transferase class number 
4, fatty acid-binding protein homologue, cytokeratln 15, and cyto- 
keratin 13; 8 (from /eft), fatty acid-binding protein homologue, 28-kDa 
heat shock protein, cytokeratln 13, and calcyclln; C(from /eft), a-eno- 
lase, hnRNP B1, 28-kDa heat shock protein. 14-3-3-e^ and 
pre-mRNA splicing factor; D, mesothelial keratin K7 (type IQ; £(from 
top), glutathione S-transferase-ir arid mesothelial keratin K7 (type II); 
^(from top and left), adenylyl cyclase-assoclated protein, E-cadherin, 
keratin 19, calglzzarin, phosphoglycerate mutase, annexln IV, cy- 
toskeJetal y-actln, hnRNP A1, Integral membrane protein catnexfn 
(IPSO), hnRNP H, brain-type clathrin light chain-a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear ribonucleoprotein A/B, 
translations^ controlled tumor protein, liver gIyceraldehyde-3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na,K- 
ATPase ^1 subunit; G, (from fop and left), TCP20, calglzzarin,. 70- 
kDa heat shock protein, calnexin, hnRNP H f cytokeratln 15, ATP 
synthase, keratin 19, triosephosphate Isomerase, hnRNP F, liver glyo- 
eraIdehyde-3-phosphatase dehydrogenase, glutathione Srtransfer- 
ase-w, and keratin 8; H (from left), plasma geisolin, autoantlgen cai- 
reticulin, thioredoxln, and NAD+-dependent 15 hydroxyprbstaglandin 
dehydrogenase; / (from top), prolyl 4-hydroxylase 0-subunit, cyto- 
keratln 20, cytokeratln* 17, prohibition, and fructose 1,6-biphos- 
phatase; J annexfn II; K, annexin IV; L (from top and left), 90-kDa heat 
shock protein, prolyl 4-hydroxylase 0-subunit, er-enolase, GRP 78. 
cyclophllin, and cofilin. 

gradient, and having a known chromosomal location, were 
selected for analysis In the TCC pair 827/532. Proteins were 
Identified by a cdmblnation of methods (see "Experimental 
Procedures"). In general there was a highly significant corre- 
lation (p < 0.005) between mRNA and protein alterations (Fig. 
.4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except for a group of cyto- 
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keratins encoded by genes on chromosome 17 (Fig; 5] the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a Know chromosomal 
location were detected in TCCs 733 and 335, and of these 19 
correlated (p < 0.005) with the rriRNA changes detected using 
the arrays (Fig, 4). For example, PA-FABP was highly ex- 
pressed In the non-Invasive TCC 335 but lost in the Invasive 
counterpart (TCC 733; see Fig. 5). The smaller number of 
proteins detected In both 733 and 335 was because of the 
smaller size of the biopsies. that were available. 

11 chromosomal regions where CGH showed aberrations 
that corresponded to the changes In transcript levels also 
showed corresponding changes In the protelh level (Table tl). 
These regions included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratins 17 and 20, annexins II and IV, and the fatty 
acld-binding proteins PArFABRand FBP1. Four of these pro- 
teins were encoded by genes in chromosome 17q, a fre- 
quently amplified chromosomal area' In Invasive bladder 
cancers. 

.'. DISCUSSION 
Most human cancers have abnormal DNA content, having 
lost some chromosomal parts and gained others. The present 
study provides some evidence as tq the effect of these gains 
and losses on gene expression In two pairs of non-Invasive 
and invasive TCCs using high throughput expression arrays 
and proteomlcs, in combination with CGH. In general, the 
results showed that there is a clear Individual regulation of the 
mRNA expression of single genes, which In some cases was 
superimposed by a DNA copy number effect In most cases, 
genes located In chromosomal areas with, gains often exhib- 
ited increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the fact that losses, most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fold change, 
thus being at the border of detection. In several cases, how- 

Tabus II 



Proteins whose expression level correlates with both mRNA and gene dose changes 


Protein 


Chromosomal location 


Tumor TCC. 


CGH alteration 


Transcript alteration 9 


Protein alteration 


Annexln 11 


1q21 


733 


Gain 


AbstoPres* 


Increase 


Annexln IV 


2p13 


733 


' Gain 


3.9-Fold up 


increase 


Cytokeratin 17 


17q12-q21 


327 


Gain 


3.8-Fold up 


Increase 


Cytokeratin 20 


17q21.1 


827 


Gain 


5.6-Fold up 


Increase 


(PA-)FABP 


8q21.2 


. 827 


Loss 


10-Fo!d down 


Decrease 


. FBP1 


9q22 


.827 


Gain 


2.3-Fold up 


Increase 


Plasma gelsolln 


9q31 


827 


Gain 


Abs to Pres 


Increase 


Heat shock protein 28 


15q12-q13 


827 


Loss 


2.5-Fold up 


Decrease 


Prohibits 


17q21 


827/733 


Gain 


3.7-/2.5-F6ld up 6 


Increase 


Prolyl-4-hydroxyt 


17q25 


827/733 


Gain 


5.7V1.6-Fold up 


Increase 


hnRNPBI 


7p15 


827 


Loss 


2.5-Fold down 


Decrease 



• Aba, absent; Pres. present. 

b In cases where the corresponding alterations were found in both TCCs 827 and 733 these are shown as 827/733. 
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Fig. 5. Comparison of protein and transcript levels in Invasive 
and non-invasive TCCs. The upper part of the figure shows a 2D gel 
{left) and the oligonucleotide array {right) of TCC 532. The red rectan- 
gles on the upper get highlight the areas that are compared below. 
Identical areas of 2D gets of TCCs 532 and 827 are shown below. 
Clearly, cytokeratins 13 and 15 are strongly down-regulated In TCC 
827 (red annotation). The tile on the array containing probes for 
cytokeratin 15 Is enlarged below the array (red arrow) from TCC 532 
and is compared with TCC 827. The upper row oi squares In each tile 
corresponds to perfect match probes; the lower row corresponds to 
mismatch probes containing a mutation (used for correction for un- 
speciftc binding). Absence of signal Is depicted as black, and the 
higher the signal the lighter the color. A high transcript level was 
detected In TCC 532 (6151 units) whereas a much lower level was 
detected in TCC 827 (absence of signals); For cytokeratin 13, a high 
transcript level was also present in TCC 532 (15659 units), and a 
much lower level was present In TCC 827 (623 units). The 2D gels at 
the bottom of the figure pen) show levels of PA-FABP and adlpocyte- 
FABP In TCCs 335 and 733 (Invasive), respectively. Both proteins are 
down-regulated in the invasive tumor. To the right we show the array 
tiles for the PA-FABP. transcript A medium transcript level was de- . 
tected in the case of TCC 335 (1277 units) whereas very low levels 
were detected In TCC 733 (166 units). /EF, isoelectric focusing. 
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ever, an Increase or decrease In DMA copy number was 
associated with de novo occurrence or complete loss of tran- 
script, respective!/. Some of these transcripts could not be 
detected In the non-Invasive tumor but were present at rela- 
tively high levels In areas with DNA amplifications In the Inva- 
sive tumors (&£. in TCC 733 transcript from cellular ligand of 
annexin II gene (chromosome 1q21) from absent to 2670 
arbitrary units; In TCC 827 transcript from small proline-rich 
protein 1 gene (chromosome 1q12-g21,1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an Increased expres- 
sion to a certain chromosomal area indicates an increased 
likelihood of gain of chromosomal material In this area. 
' Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may lie in the los§ of controlled 
methylation in tumor cells (17-19). Thus, it may be possible 
that In chromosomes with increased DNA copy numbers two 
or more alleles could be demethylated simultaneously leading 
to a higher transcription level, whereas In chromosomes with 
losses the remaining allele could be partly, methylated, turning 
-off the process (20, 21). A recent report has documented a 
ploidy regulation of gene expression in yeast, but In.this case ail 
the genes were present in the same ratio (22), a situation that is 
hot analogous to that of cancer ceils, which show marked 
chromosomal aberrations, as well as gene dosage effects. 

Several CGH studies of bladder cancer have shown that 
some chromosomal aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3 tumors. In pTa tumors, these include 9p-, 9q-, 1q+, Y- 
(2, 6), and In pTI tumors, 2q-,11p-, 11q-, 1q+, 5p+, 8q+ t 
17q+, and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p— and 9q22-q33- and 
9q- and Y-, respectively. Likewise, the two minimal invasive 
pTI tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1 q2£-24 
amplification (seen In both tumors), 1 1 q14-q22 loss, the latter 
often linked to 17 q+ (both tumors), and 1q+ and 9p-, often 
linked to 20q+ and 11 q13+ (both tumors) (7-9). These ob- 
servations indicate that the pairs of tumors used, in this study 
exhibit chromosomal changes observed in many tumors, and 
. therefore the findings could be of general importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 megabases it is only possible to get a crude picture of 
chromosomal instability using this technique. Occasionally, 
we observed reduced transcript levels close to or Inside re- 
gions with increased copy numbers. Analysis of these regions 
by positioning heterozygous microsatellltes as close as pos- 
sible to the locus showing reduced gene expression revealed 
loss of heterozygosity In several cases. It seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use of cDNA microarrays for analysis of DNA 
copy number chahges will reach a resolution that can resolve 
these changes, as has recently been proposed (2). The outlier 
data were not more frequent 1 at the boundaries of the CGH 
. aberrations. At present we dd.not know the mechanism be- 
hind chromosomal aneuploidy and cannot predict whether 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic Imprinting has 
. an impact on the expression level in normal cells and Is often 
reduced in tumors. However, the relation between imprinting 
and gain of chromosomal material is not known. 

We regard It as a strength of this Investigation that we were 
able to compare jnvasive tumors to benign tumors rather than 
to normal urothelium, as the tumors studied were biologically 
very close. and probably may represent successive steps in 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available it was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and rnRNA expression Is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This indicate that 
bladder tumors are relatively homogenous, a notion recently 
supported by CGH and LOH data that showed a remarkable 
similarity even between tumors and distant metastasis (10, 23). 

In the few cases analyzed, rnRNA and protein levels 
showed a striking correspondence although in some cases 
we found discrepancies that may be attributed to transtatlonaJ 
regulation, post-translationai processing; protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertranslated rnRNA pools, which are associated with few 
translationaily Inactive ribosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very Important in the. case of polypeptides with a short 
half-life (e.g. signaling proteins). A poor correlation between 
rnRNA and protein levels was found in liver ceils as .deter- 
mined by arrays and.2D-PAGE (26), and a moderate correla- 
tion was recently reported by Ideker ef a/. (26) in yeast. 
Qnterestingly, our study revealed a much better correlation 
between gained chromosomal areas and increased. rnRNA 
levels than, between loss of chromosomal areas and. reduced 
rnRNA levels. In general, the level of CGH change determined 
the ability to detect a change In transcript) One possible 
explanation could be that by losing one allele the change in 
rnRNA level Is not so dramatic as compared with gain of 
material, which can be rather unlimited and may lead to a 
seyeraifold increase In gene copy number resulting in a much 
higher Impact on transcript level. The latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2-fold level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may in the future allow detection of lesser than 2-fold 
alterations in transcript levels, a feature that may facilitate the 
analysis of the effect of loss of chromosomal areas on tran- 
script levels. 
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In eleven cases we found a significant correlation between 
DNA.copy number, mRfsJA expression, and protein level. Four 
of these proteins were encoded by genes located at a fre- 
quently-amplified area In chromosome I7q. Whether DNA 
copy number Is one of the mechanisms behind alteration of 
these eleven proteins Is iat present unknown and will have to 
be proved by other methods using a larger number of sam- 
ples. One factor making such studies complicated Is the large 
extent of protein modification that occurs after translation, 
requiring immunoidentification and/or mass spectrometry to 
correctly Identify the proteins In the gels. 

In conclusion, the results presented In this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on migroairays with many thousand radiation 
hybrid-mapped genes will increase the resolution and fnforma- 
. tlon derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived from genes with 
known locations, and 2D gel analysis to obtain. Information at 
the post-translational level, a clearer and more developed un- 
derstanding of the tumor genome will, be forthcoming. 
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ABSTRACT 

Genetic changes underlie tumor progression and may lead to cancer- 
specific, expression or critical genes. Over 1100 publications have de- 
scribed the use of comparative genomic hybridization (CGH) to analyze 
the pattern of copy number alterations in cancer, but very few of the genes 
affected are known. Here, we performed high-resolution CGH analysis on 
cDNA microarrayt in breast cancer and directly compared copy number 
and roRNA expression levels of 13,824 genes to quautitate the impact of 
genomic changes on gene expression. We identified and mapped the 
boundaries of 24 independent ampticoas, ranging in size from 0 J to 12 
Mb. Throughout the genome, both high- and low-level copy number 
changes had a substantia] impact on gene expression, with 44% of the 
highly amplified genes showing overexpression and 10.5% of the highly 
overexpressed genes being amplified. Statistical analysis with random 
permutation tests identified 270 genes whose expression levels across 14 . 
samples were systematically attributable to gene amplification.- These 
included most previously described amplified genes in breast cancer and 
many novel targets for genomic alterations, including the HOXB7 gene, 
the presence of which in a novel ampllcon at 17q213 was validated in 
'10.2% of primary breast cancers and associated with poor patient prog- 
nosis. In conclusion, CGH on. cDNA mlcroarrays revealed hundreds of 
novel genes whose overexpression Is attributable to gene amplification* 
These genes may provide insights to the clonal evolution and progression 
of breast cancer and highlight promising therapeutic targets. 

INTRODUCTION 

Gene expression patterns revealed by cDNA microarrays have 
facilitated classification of cancers into biologically distinct catego- 
ries, some of which may explain the clinical behavior of the tumors 
• (1-6), Despite this progress in diagnostic classification, the molecular 
mechanisms underlying gene expression patterns in cancer have re- 
mained elusive, and the utility of gene expression profiling in the 
identification of specific therapeutic targets remains limited 

Accumulation of genetic defects is thought to underlie the clonal 
evolution of cancer. Identification of the genes that mediate the effects 
of genetic changes may be important by highlighting transcripts that 
are actively involved in tumor progression. Such transcripts and their 
encoded proteins would be ideal targets for anticancer therapies, as 
demonstrated by the clinical success of new therapies against ampli- 
fied oncogenes, such as ERBB2 and EGFR (7, 8), in breast cancer and 
other solid tumors. Besides amplifications of known oncogenes, over 




Expression ratio 



Fig. I . Impact of gene copy number on global gene expression levels. A. percentage of 
over- and uadcrcxpresscd genes (y axis) according to copy number ratios (X axis). 
Threshold values used for over- and tmdcicxpressibn were >2.184 (global upper 7% of 
the cDNA ratios) and <0.4826 (global lower 7% of the expression ratios). & percentage 
of amplified and deleted genes according to expression ratios. Threshold values for 
amplification and deletion were >1.5 and <0.7. 



20 recurrent regions of DNA amplification have been mapped in 
breast cancer by CGH 5 (9, 10). However, these amplicons are often 
large and poorly defined, and their impact on gene expression remains 
unknown. 

We hypothesized that genome-wide identification of those gene 
expression changes that are attributable to underlying gene copy 
"s^to^*^^ number alterations would Wghlight tnuiscripts that are actively in- 

ncer Society, the Hrkanmaa Cancer Society, the Pirkanmaa Cultural Foundation, the v Olved m the causation or maintenance of the malignant phenotype. 
tnish Breast Cancer Group, the Foundation for the Dcvcloomcnt of Laboratory Med- To identify such transcripts, we applied a Combination Of cDNA and 
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CGH microarrays to: (a) determine the global impact that gene copy 
number variation plays in breast cancer development and progression; 
and (b) identify and characterize those genes wnose iriRNA expres- 
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3 The abbreviations used are: CGH. comparative genomic hybridization; FISH, fluo- 
rescence In situ hybridisation; RT-PCR, reverse transcription-PCR. 



GENE EXPRESSION PATTERNS IN BREAST CANCER 
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Fig. 2. Gcnom^widc copy number and expression analysis fn the MCF-7 breast cancer cell line. A, chromosomal. CGH analysis of MCF-7! The copy number ratio profile (Wife 
line) across the entire genome, from Ip telomere to Xq telomere is shown along with ± I SD. (orange lines). The black horizontal line indicates a ratio of 1.0: red line, a ratio of 0.8* 
■SfELj? V 19X10 0fl2 -*< genome-wide copy number analysis to MCF-7. by CGH on cDNA microarray. The copy number ratios were plotted as a function of the position 




(undepressed genes); the rest of the observations are shown with black crosses. The chromosome numbers are shown at the bottom of the figure.- and chromosome boundaries are 
rnriir*i«4 with a dashed line, . 



sion is most significantly associated with amplification of the corre- 
sponding genomic template. 

MATERIALS AND METHODS 

Breast Cancer Ceil Lines. Fourteen breast cancer cell lines (BT-20, BT- 
474, HCC1428, Hs578t, MCF7, MDA-361, MDA-436, MDA-453, MDA-468, 
SKBR-3, T-47D, UACC812, ZR-75-1, and ZR-75-30) were obtained from the 
American Type Culture Collection (Manassas, VA). Cells were grown under 
recommended culture conditions. Genomic DNA and mRNA were isolated 
using standard protocols. 

Copy Number and Expression Analyses by cDNA Microarrays. The 
preparation and printing of the 13,824 cDNA clones on glass slides were 
performed as described (J 1-13). Of these clones, 244 represented uncharac- 
terized expressed sequence tags, and the remainder corresponded to known 
genes. CGH experiments on cDNA microarrays were done as described (14, 
.15). Briefly, 20 jig of genomic DNA from breast cancer cell lines and normal 
human WBCs were digested for 14-18 h with AM and Aral (Life Technol- 
ogies, In<x, Roclcyille, MD) and purified by phenolAtoloroform extraction. Six 
pg of digested cell line DNAs were labeled with Cy3-dUTP (Amersham 
Pharmacia) and normal DNA with CyS-dUTP (Amersham Pharmacia) using 
the Bioprime Labeling kit (Lift Technologies, Inc.). Hybridization (14, 1 5) and 
posthybridization washes (13) were done as described. For the expression 
analyses, a standard reference (Universal Human Reference RNA; Stratagene, 
La Jolla,' CA) was used in all experiments. Forty jig of reference RNA were 
labeled with Cy3-dUTP and 3.5 fig of test mRNA with Cy5-dUTP, and the 
labeled cONAs were hybridized on microarrays as described (13, 15). For both 
microarray analyses, a laser confocal scanner (Agilent Technologies, Palo 
Alto, CA) was used to measure the fluorescence intensities at the target 
locations using the DEARRAY software (16). After background subtraction, 
average intensities at each clone in the test hybridization were divided by the 
average intensity of the corresponding clone in the control hybridization. For 
the copy number analysis, the ratios were normalized on the basis of the 
distribution of ratios of all targets on the array and for the expression analysis 
on the basis of 88 housekeeping genes, which were spotted four times onto the 
array. Low quality measurements (/.e., copy number data with mean reference 
intensity <100 fluorescent units, and expression data with both test and 
reference intensity <100 fluorescent units and/or with' spot size <50 units) 



were excluded from the analysis and were treated as missing values. The 
distributions of fluorescence ratios were used to define outpoints for increased/ 
decreased copy number.. Genes with CGH ratio >1,43 (representing the upper 
5% of the CGH ratios across all experiments) were considered to be amplified, 
and genes with ratio <0;73 (representing the lower 5%) were considered to be 
deleted.' 

Statistical Analysis of CGH and cDNA Microarray Data. To evaluate 
the influence of copy number alterations on gene expression, we applied the 
following statistical approach. CGH and cDNA calibrated intensity ratios were 
log-transformed and normalized using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across all 14 cell lines were 
median centered. For each gene, the CGH data were represented by a vector 
that was labeled 1 for amplification (ratio, >1.43) and 0 for no amplification. 
Amplification was correlated with gene expression using the signal-tortoise 
statistics (1). We calculated a weight, for each gene as follows: 

nig, - mgo 



where m gU o* gl and m^, cr^ denote the means and SDs for the expression 
■ levels for amplified and nonamplified cell lines, respectively, to assess the 
statistical significance of each weight, we performed i 0,000 random permu- 
tations of the label vector. The probability that a gene had a larger or equal 
weight by random permutation than the original weight was denoted by a. A 
low a (<0.05) indicates a strong association between gene expression and 
amplification. 

Genomic Localization of cDNA Clones and Amptlcon Mapping. Each 
cDNA clone on the microarray was assigned to a Unigene cluster using the 
Unigene Build 141,* A database of genomic sequence alignment information 
for mRNA sequences was created from the August 2001 freeze of the Uni- 
versity of California Santa Cruz's GoldenPath database. 7 The chromosome and 
bp positions for each cDNA clone were then retrieved by relating these data 
sets. Amp] icons were denned as a CGH copy number ratio >2.0 in at least two 
adjacent clones in two or more cell lines or a CGH ratio >2.0 in at least three 
adjacent clones in a single cell line. The amplicon start and end positions were 



0 Internet' address: http://rc4caJcb.tAgri.nfo 
7 Internet address: www.gcnome.ucsc.edu. 
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Table I Summary qf independent ampi icons In 14 breast cancer cell lines by 
CGH microarray j 



Location 
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17a21.32~o2I.33 
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33 
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69.70 


5.9. 


.. I7q23.3-q24.3 


69.93 


74.99 


5.1 


I9ql3 


40.63 


41.40 


0.8 


. 20qllJ2 
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1.6 
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3.0 
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5U2 
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extended to include neighboring nonamplified clones (ratio, <1.5). The am- 
plicon size determination was partially dependent on local clone density. 

FISH. Dual-color interphase FISH to breast cancer cell lines was done as 
described (17). Bacterial artificial chromosome clone RP11-361K8 was la- 
beled with SpectrurnOmnge (Vysis, Downers Grove, IL), and Spectrum- 
Orange-labeled probe for EGFR was obtained from Vysis. SpectrumGrecn- 
labeled chromosome 7 and 17 centromere probes (Vysis) were used as a 
reference. A tissue microarray containing 612 formalin-fixed, paraffin-embed- 
ded primary breast cancers (17) was applied in FISH analyses as described 
(1 8). the use of these specimens was approved by the. Ethics Committee of the 
University of Basel and by the NIH. Specimens containing a 2-fold or higher 
increase in the number of test probe signals, as compared with corresponding 
centromere, signals, in at least 10% of the tumor cells were considered to be 
amplified. Survival analysis was performed using the Kaplan-Meier method 
. and the log-rank test. 

RT-PCR. the HOXB7 expression level was determined relative to 
GAPDH. Reverse transcription and PCR amplification were performed using 
Access. RT-PCR System (Promega Corp., Madison, WQ with 10 ng of mRNA 
as a template. HOXB7 primers were 5'-OAO^GA(K3GACT(X3GACTT-3' 
and 5'-GCGTCAGOTAGCGATTGTAG-3'. 

RESULTS 

Global Effect of Copy Number on Gene Expression. 13,824 
arrayed cDNA clones were applied. for analysis of gene expression 
and gene copy number (CGH microarrays) in 14 breast cancer cell 
lines. The results illustrate a considerable influence of copy number 
on gene expression patterns. Up to 44% of the highly amplified 
transcripts (CGH ratio, >2.5). were overexpressed (/.e., belonged to 
the global upper 7% of expression ratios), compared with only 6% for 
genes with normal copy number levels (Fig. 1A). Conversely, 10.5% 
of the transcripts with high-level expression (cDNA ratio, >10) 
showed increased copy number (Fig. 12?). Low-level copy number 
increases and decreases were also associated with similar, although 
less dramatic, outcomes on gene expression (Fig. 1). 

Identification of Distinct Breast Cancer Amplicons. Base-pair 
locations obtained for 1 1,994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig. 2, Supple- 
ment Fig. A). The average spacing of clones throughout" the genome 
was 267 kb. This high-resolution mapping identified 24 independent 
breast cancer amplicons, spanning from 0.2 to 12 Mb of DNA (Table 
1). Several amplification sites detected previously by chromosomal 



CGH Were validated, with.lq21, 17ql2-q21.2, 17q22-q23, 20*13.1, 
and 20ql3.2 regions being most commonly amplified, furthermore, 
the boundaries of these amplicons were precisely delineated; In ad- 
dition, novel amplicons were identified at 9pl3 (38.65-39.25 Mb); 
and 17q2L3 (52.47-55.80 Mb). 

Direct Identification of Putative Amplification Target Genes, 
The cDKA/CGH microarray technique enables the direct correla- 
tion of copy number and expression data on a gene-by-gene basis 
throughout the genome. We directly annotated high-resolution 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of the amplified genes in the MCF-7 breast cancer 
cell line at lpl3, 17q22-q23, and 20ql3 were highly overex- 
pressed. A view of chromosome 7 in the MDA-468 cell line 
implicates EGFR as the most highly overexpressed and amplified 
gene at 7pi l-pl2 (Fig. $A). In BT-474, the two known amplicons 
at 17ql2 and 17q22-q23 contained numerous highly overex- 
pressed genes. (Fig. 3B). In addition, several genes, including the 
homeobox genes HOXB2 and HOXB7, were highly amplified in a 
previously undescribed independent amplicon at I7q2I,3. HOXB7 
was systematically amplified (as validated by FISH, Fig. 35,. Inset) 
as well as overexpressed (as verified by RT-PCR, data not shown) 
in BT-474, UACC8 12, and ZR-75-30 cells. Furthermore, this novel 




Fig. 3. Annotation of gene expression date on CGH microarray profiles. A, genes in the 
7pl )-pl2 amplicon in the MDA-468 eel) line are highly expressed (red dots) and include 
the EGFR oncogene. B, several genes in the I7ql2, I7q2l3, and 17q23 amplicons in the 
BT-474 breast cancer ceil line are highly overexpressed {red) and include the HOXB7 
gene. The data labels and color coding arc as indicated for Fig. 2C Insets show 
chromosomal CGH profiles for the corresponding chromosomes and validation of the 
increased copy number by interphase FISH using EGFR {red} and chromosome 7 
centromere probe (green) to MDA-468 (A) and /fOB97-«pccifIc probe (red) and chro- 
mosome 17 centromere (green) to BT-474 cells (B). 
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Fig. 4. List of 50 genes with a statistically 
significant correlation (a value <0.05) between 
gene copy number and gene expression. Name, 
chromosomal location, and the a value for each 
gene arc indicated. The genes have been ordered 
according to their position in the genome. The color 
maps on the right illustrate the copy number and 
expression ratio patterns in the 14 cell lines. The 
key to the color code is shown at the bottom of the 
graph. Gray squares, missing values. The complete 
list of 270 genes is shown in supplemental Fig. B. 
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amplification wfts validated to be present jn 10.2% of 363 primary 
breast cancers by FISH to a tissue microarray and was associated 
with poor prognosis of the patients (P = 0.001). 

Statistical Identification and Characterization of 270 Highly 
Expressed Genes in Ampiicons. Statistical comparison of expres- 
sion levels of all genes as a function of gene amplification identified 
270 genes whose expression was significantly influenced by copy 
number across all 14 cell lines (Fig, 4, Supplemental Fig. B). Accord- 
ing to the gene ontology data, 8 91 of the 270 genes represented 
hypothetical proteins or genes with no functional annotation, whereas 
179 had associated functional information available. Of these, 151 
(84%) are implicated in apoptosis, cell proliferation, signal transduc- 
tion, and transcription, whereas 28 (16%) had functional annotations 
that could not be directly linked with cancer. 



DISCUSSION 

The importance of recurrent gene and chromosome copy number 
changes in the development and progression of solid tumors has been 
characterized in >1000 publications applying CGH 9 (9, 10), as well 
as in a large number of other molecular cytogenetic, cytogenetic, and 
molecular genetic studies. The effects of these somatic genetic 
changes on gene expression levels have remained largely unknown, 
although a few studies have explored gene expression changes occur- 
ring in specific ampiicons (15, 19-21). Here, we applied genome- 
wide cDNA microarrays to identify transcripts whose expression 
changes were attributable to underlying gene copy number alterations 
in breast cancer. 

The overall impact of copy number on gene expression patterns was 
substantial with the most dramatic effects seen in the case of higb- 



* Internet address: http:ZAvww.gcTieontology.org/. 
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level copy number increase. Low-level copy number gains and losses 
also had a significant influence on expression levels of genes' in the 
regions affected, but these effects were more subtle on a gene-by-gene 
basis than those of high-level amplifications. However, the impact of 
low-level gains on the dysregulation of gene expression patterns in 
cancer may be equally important if not more important than that of 
high-level amplifications. Aneuploidy and low-level gains and losses 
of chromosomal arms represent the most common types of genetic 
alterations in breast and other cancers and, therefore, have an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on the impact of aneuploidy on global gene expression pat-, 
terns in ye*st cells, acute myeloid leukemia, and a prostate cancer 
model system (22-24). 

The CGH microarray analysis identified 24 independent breast 
cancer amplicons. We defined the precise boundaries for many am- 
plicons detected previously by chromosomal CGH (9, 10, 25, 26) and 
also discovered novel amplicqns that had not been detected previ- 
ously, presumably because of their small size (only 1-2 Mb) or close 
proximity to other larger amplicons. One of these novel amplicons 
, involved the homeobox gene region at 17q21 3 and led to the over- 
expression of the HOXB7 and HOXB2 genes. The homeodomain 
transcription factors are known to be key regulators of embryonic 
development and have been occasionally reported to undergo aberrant 
expression in cancer (27, 28). HOXB7 transfection induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
tumorigenicity and angiogenesis in breast cancer (29-32). The pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for overexpressing HOXB7 in breast cancer and suggest that 
HOXB7 contributes to tumor progression and confers an aggressive 
. disease phenotype in breast cancer. This view is supported by our 
rinding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well as an association of amplification with poor prognosis 
of the patients. • v . 

We carried out a systematic search to identify genes whose 
expression levels across all 14 cell lines were attributable to 
\ amplification status. Statistical analysis revealed 270 such genes 
• (representing ~2% of all genes on the array), including not only 
previously described amplified genes, such as HER-2 9 MYC % 
EGFR, ribosomal protein s6 kinase, and ALB3, but also numerous 
novel genes such as NRAS-related gene (lpl3), syndecan-2 (8q22), 
and bone morphogenic protein (20ql3.1), whose activation by 
amplification, may similarly promote breast cancer progression. 
Most of the 270 genes have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of them to be causally 
involved, it is intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biological insights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 

In summary, we. demonstrate application of oDNA microarrays 
to the analysis of both copy number and expression levels of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once every 267 kb. This analysis provided: (a) evidence of "a 
m y prominent global influence of copy number changes on gene 
expression levels; (6) a high-resolution map of 24 independent 
amplicons in breast cancer; and (c) identification of a set. of 270 
genes, the overexpression of which was statistically attributable to 
gene amplification. Characterization of a novel amplicon at 
17q21.3 implicated amplification and overexpression of the 
HOXB7 gene in breast, cancer, including a clinical association 
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between HOXB7 amplification and poor patient prognosis. Overall, 
our results illustrate how the identification of genes activated by 
gene amplification provides a powerful approach to highlight 
genes with an important role in cancer as well as to prioritize and 
validate putative targets for therapy development 
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Microarray analysis reveals a major direct role of 
DNA copy number alteration in the transcriptional 
program of human breast tumors 

Jonathan R. Pollack**, Therese Sorlle 8 , Charles M. Perou". Christian A. Rees«*» Stefanie S. Jeffrey" Per E. Lonnlna** 
Robert Tibshiraniss, David BotstelnS. Anne-Use Bcrresen-Dale*. and Patrick O. Brown™ 

Contributed by Patrick O. Brown, August 6, 2002 



Genomic DNA copy number alterations are key genetic events in 
the development and progression of human cancers. Here we 
report a genome-wide microarray comparative genomic hybrid- 
ization (array CGH) analysis of DNA copy number variation in 
a series of primary human breast tumors. We have profiled DNA 
copy number alteration across 6,691 mapped human genes, in 44 
predominantly advanced, primary breast tumors and 10 breast 
cancer 'cell lines. While the overall patterns of DNA amplification 
and deletion corroborate previous cytogenetic studies, the high- 
resolution (gene-by-gene) mapping of amplicon boundaries and 
the quantitative analysis of amplicon shape provide significant 
Improvement in the localization of . candidate oncogenes. Parallel 
microarray measurements of mRNA levels reveal the remarkable 
degree to which variation in gene copy number contributes to 
variation in gene expression in tumor cells. Specifically, we find 
that 62% of highly amplified genes show moderately or highly 
' elevated expression, that DNA copy number influences gene ex- 
presslon across a wide range of DNA copy number alterations 
(deletion, low-, mid- and high-level amplification), that on average, 
a 2-fold change in DNA copy number Is associated with a corre- 
sponding 1.5-fold change in mRNA levels, and that overall, at least 
12% of all the variation in gene expression among the breast 
tumors Is directly attributable to underlying variation in ge.ne copy 
number. These findings provide evidence that widespread DNA 
copy number alteration can lead directly to global deregulation of 
. gene expression, which may contribute to the development or 
progression of cancer. 

Conventional cytogenetic techniques, including comparative . 
genomic hybridization (CGH) (l) f have led to. the identifi- 
cation of a number of recurrent regions of DNA copy number 
alteration in breast, cancer cell lines and tumors (2-4). While 
some of these regions contain known or candidate oncogenes 
[e,g^ FGFR1 (8pll), MYC (8q24), CCND1 (Uql3), ERBB2 
<17ql2), and 2NF217 (20ql3)J and tumor suppressor genes 
[RBI (13ql4) and TP53 (17pl3)], the relevant gene(s) within 
other regions (e.g., gain of lq, 8q22, and 17q22-24, and loss of 
8p) remain to be identified. A high-resolution genome-wide 
map, delineating the boundaries of DNA copy number alter- 
ations in tumors, should facilitate the localization and identifi- 
cation of oncogenes and tumor suppressor genes in breast 
cancer. In this study, we have created such a map, using 
array-based CGH (5-7) to profile DNA copy number alteration 
in a series, of breast cancer cell lines and primary tumors. 

An unresolved question is the extent to which the widespread 
DNA copy number changes that we and others have identified 
in breast tumors alter expression of genes within involved 
regions. Because we had measured mRNA levels in parallel in 
the same samples (8), using the same DNA microarrays, we had 
an opportunity to explore on a genomic scale the relationship 
between DNA copy number changes and gene expression. From 



this analysis, we have identified a significant impact of wide- 
spread DNA copy number alteration on the transcriptional 
programs of breast tumors. 

Materials and Methods . 

Tumors and Cell Lines. Primary breast tumors were predominantly 
large (>3 cm), intermediate-grade, infiltrating ductal carcino- 
mas, with more than 50% being lymph node positive, the 
fraction of tumor cells within specimens averaged at least 50%. 
Details of individual tumors have been published (8, 9), and 
are summarized in Table 1, which is published as supporting 
information on the PNAS web site; www.pnas.org. Breast cancer 
cell lines were obtained from the American Type Culture 
Collection. Genomic DNA was isolated either using Qiagen 
genomic DNA columns, or by phenol/chloroform extraction 
followed by ethanol precipitation. 

DNA Labeling and Microarray Hybridizations. Genomic DNA label- 
ing and hybridizations were performed essentially as described 
in Pollack et at. (7), with slight modifications. Two micrograms 
of DNA was labeled in a total volume of 50 microliters and the 
volumes of all reagents were adjusted accordingly, 'Test" DNA 
(from tumors and cell lines) was fluorescently labeled (Cy5) and 
hybridized to a human cDNA microarray containing 6,691 
different mapped human genes (i.e., UniGene clusters). The 
"reference" (labeled with Cy3) for. each hybridization was nor- 
mal female leukocyte DNA from a single donor. The fabrication 
of cDNA microarrays and the labeling and hybridization of 
mRNA samples have been described (8). 

Data Analysis and Map Positions. Hybridized arrays were scanned 
on a GenePix scanner (Axon Instruments, Foster City, CA), and 
fluorescence ratios (test/reference) calculated using scanalyze 
software (available at http://rana.lbl.gov). Fluorescence ratios 
were normalized for each array by setting the average log 
fluorescence ratio for all array elements equal to 0. Measure- 
ments with fluorescence intensities more than 20% above back- 
ground were considered reliable. DNA copy number profiles 
that deviated significantly from background ratios measured in 
normal genomic DNA control hybridizations were interpreted as 
evidence of real DNA copy number alteration (see Estimating 
Significance of Altered Fluorescence Ratios in the supporting 
information). When indicated, DNA copy number profiles are 
displayed as a moving average (symmetric 5 r nearest neighbors). 
Map positions for arrayed human cDNAs were assigned by 
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Flg.1. Genome-wide measurement of DNA copy number aKeratlon by array 

numbers iof X chromosomes, for breast cancer cell lines, and for breast tumors. Each row represents a different cell line or.tumor, and each column represents 
oneof 6.691 different mappedhuman genes present on the microarray, ordered by genome ma 

5-nearest neighbors) fluorescence ratios (test/reference) are depicted using a logrbased pseudocolor scale (Indicated), such that red luminescence reflects 
fold-amplfficatlon, green luminescence reflects fold-deletion, and black Indicates ho change (gray Indicates poorly measured data). (6) Enlarged view of DNA 
copy number profiles across the X chromosome, shown fdr-cell fines containing different numbers of X chromosomes. 



identifying the starting position of the best and longest match of 
any DNA sequence represented in the corresponding UniGene 
cluster (10) against the "Golden Path" genome assembly 
(http://genome.ucsc,edu/; Oct 7, 2000 Freeze). For UniGene 
./Clusters represented by multiple arrayed elements, mean fluo- 
rescence ratios (for all elements representing the same UniGene 
cluster) are reported. For mRNA measurements, fluorescence 
ratios are "mean-centered" (i.e., reported relative to the mean 
ratio across the 44 tumor samples). The data set described here 
can be accessed in its entirety in the supporting information. 

Results 

We performed CGH on 44 predominantly locally advanced, 
primary breast tumors and 10 breast cancer cell lines, using 
cDNA microarrays containing 6,691 different mapped human 
genes (Fig. la; also see Materials and Methods for details of 
microarray hybridizations). To take full advantage of the im- 
proved spatial resolution of array CGH, we ordered (fluores- 
cence ratios for) the 6,691 cDNAs according to the "Golden 
Path" (http://genome.ucsc.edu/) genome assembly of the draft 
human genome sequences (11). In so doing, arrayed cDNAs not 
only themselves represent genes of potential interest (e.g., 
candidate oncogenes within amplicons), but also provide precise 
genetic landmarks for chromosomal regions of amplification and 



deletion. Parallel analysis of DNA from cell lines containing 
different numbers of X chromosomes (Fig. lb), as we did before 
(7), demonstrated the sensitivity of our method to detect single- 
copy loss (45, XO), and 15- (47,XXX), 2- (48>XXXX), or 
2.5-fold (49PCXXXX) gains (also see Fig. 5, which is published 
as supporting information on the PNAS web site). Fluorescence 
ratios were linearly proportional to copy number ratios, which 
were slightly underestimated, in agreement with previous ob- 
servations (7). Numerous DNA copy number alterations were 
evident in both the breast cancer cell lines and primary tumors 
(Fig. la), detected in the tumors despite the presence of euploid 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower in the tumor samples. DNA copy-number 
alterations were found in every cancer cell line and tumor, and 
on every human chromosome in at least one sample.' Recurrent 
regions of DNA copy number gain and loss were readily iden- 
tifiable. For example, gains within lq, 8q, 17q, and 20q were 
observed in a high proportion of breast cancer cell lines/tumors 
(909&/69%, 100%/47%, 100%/60%, and 90%/44%, respective- 
ly), as . were losses within Ip, 3p, 8p, and 13q (80%/24%, 
80%/22%, 8095/22%, and 70%/18%, respectively), consistent 
with published cytogenetic studies (rets. 2-4; a complete listing 
of gains/losses is provided in Tables 2 and 3, which are published 
as supporting information oh the PNAS web site). The total 
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Fig.2. DNA copy number alteration across chromosome 8 by array C6H. (a) DNA copy number profiles are illustrated for cell lines containing different number* 
} of X chromosomes, for breast cancer eel! lines, and for breast tumors. Breast cancer cell lines and tumors are separately ordered by hierarchical clustering to 
highlight recurrent copy number changes. The 241 genes present on the mlcroarrays and mapping to chromosome 8 are ordered by position along the 
chromosome. Fluorescence ratios (test/reference) are depicted by a log 2 pseudocolor scale (Indicated). Selected genes are Indicated wHhcotor-coded text (red, 
increased; green, decreased; black, no change; gray, not well measured) to reflect correspondingly altered mRNA levels (observed In the majority of the subset 
of samples displaying the DNA copy number change). The map positions for genes of Interest that are not represented on the mlcroarray are Indicated In the 
row above those genes represented on the array. (6) graphical display of DNA copy number profile for breast cancer cell line SKBR3. Fluorescence ratios 
(tumor/normal) are plotted on a log 2 scale for chromosome 8 genes, ordered along the chromosome. 



number of genomic alterations .(gains and losses) was found to 
be sign ificantly higher in breast tumors that were high grade (P = 
O.008), consistent with published CGH data (3), estrogen recep- 
tor negative (P » 0.04), and harboring TP53 mutations (P - 
0,0006) (see Table 4, which is published as supporting informa- 
tion on the PNAS web site). 

The improved spatial resolution of our array CGH analysis is 
illustrated for chromosome 8, which displayed extensive DNA 
copy number alteration^in our. series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously ■ 
uncharacterized oncogene (Fig. 2a). The complexity of amplicon 
structure is most easily appreciated in the breast cancer cell line 
SKBR3. Although a conventional CGH analysis of 8q in SKBR3 
identified only two distinct regions of amplification (12), we 
observed .three, distinct regions of high-level amplification (la- 
beled 1-3 in Fig. 2b). For each of these regions we can define the 



boundaries of the interval recurrently amplified in the tumors we 
examined; in each case, known or plausible candidate oncogenes 
can be identified (a description of these regions, as well as the 
recurrently amplified regions on chromosomes 17 and 20, can be 
found in Figs. 6 and 7, which are published as supporting 
information on the PNAS web site); 

For a subset of breast cancer cell lines and tumors (4 and 37, 
respectively), and a subset of arrayed genes (6,095), mRNA 
levels were quantitatively measured in parallel oy using cDNA 
microarrays (8). The parallel assessment of mRNA levels is 
useful" in the interpretation of DNA copy number changes. For 
example, the highly amplified genes that are also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
con. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides.us the oppor- 
tunity to assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor cells. 

A strong influence of DNA copy number on gene expression 
is evident m an examination of the pseudocolor representations 
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-Fig. 3. Concordance between DNA copy number and gene expression across chromosome 17. DNA copy number alteration (Upper) and mRNA levels (tower) 
are Illustrated for breast cancer cell lines and tumors. Breast cancer cell lines and tumors are separately ordered by hierarchical clustering (Upper), and the 
identical sample order Is maintained (Lower)* The 354 genes present on the mlcroarrays and mapping to chromosome 1 7, and for which both DNA copy number 
and mRNA levels were determined, are ordered by position along the chromosome; selected genes are Indicated In color-coded text (see Fig. 2 legend), 
Fluorescence ratios (test/reference) are depicted by separate loga pseudocolor scales (Indicated). 



of DNA copy number and mRNA levels for genes on chromo- 
some 17 (Fig. 3). The overall patterns of gene, amplification and 
elevated gene expression are quite concordant; i.e., a significant 
fraction of highly amplified genes appear to be correspondingly 
highly expressed. The concordance between high-level amplifi- 
cation and increased gene expression is not restricted to chro- 
mosome 17. Genome-wide, of 117 high-level DNA amplifica- 
tions (fluorescence ratios >4, and representing 91 different 
genes), 62% (representing 54 different genes; see Table 5, which 
is published as supporting information on the PNAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-entered fluorescence ratios >2), and 42% (rep- 
resenting 36 different genes) are found associated with compa- 
rably highly elevated mRNA levels (mean-centered fluorescence 
ratios >4). 

. To determine the extent to which DNA deletion arid lower- 
level amplification (in addition to high-level amplification) are 
also associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 cell lines and 37 tumors, across 6,095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low-, 
medium-, and high-level amplification (Fig. 4o). For both the 



breast cancer cell lines and tumors, average mRNA levels 
tracked with DNA copy number across all five classes, in a 
statistically significant fashion (P values for pair-wise Student's 
t tests comparing adjacent classes: cell lines, 4 X 10~* 9 , 1 x 10~ 49 , 
5 x 10- 5 , 1 X lO" 2 ; tumors, 1 X 10"- 43 , 1 X KT 214 , 5 x lfj- 41 , 
1 X 1(T 4 ). A linear regression of the average log(DNA copy 
number), for each class, against average log(mRNA level) 
demonstrated that on average, a 2-fold change in DNA copy 
number was accompanied by 1.4- and 1.5-fold changes in mRNA . 
level for the breast cancer cell lines and tumors, respectively (Fig. 
4a, regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy num- 
ber and mRNA level, each across the 37 tumor samples (Fig. 46). 
The distribution of correlations forms a. normal-shaped curve, 
but with the peak markedly shifted in the positive direction from 
zero. This shift is statistically significant, as evidenced in a plot 
of observed vs. expected correlations (Fig. 4c), and reflects a 
pervasive global influence of DNA copy number alterations on 
gene expression. Notably, the highest correlations between DNA 
copy number and mRNA level (the right tail of the distribution 
in Fig. 4b) comprise both amplified and deleted genes (data not 
shown). Third, we used a linear regression model to estimate the 
fraction of all variation measured in mRNA levels among the 37 
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Fig. .4. Genome-wfde Influence of DNA copy number alterations on mRNA levels, (a) For breast cancer cell lines (gray) and tumor samples' (black), both 
mean-centered mRNA fluorescence ratio (loga scale) quartlles (box plots Indicate 25th, 50th, and 75th percentile) and averages (diamonds; V-value error bars 
Indicate standard errors of the mean) are plotted for each of five classes of genes, representing DNA deletion (tumor/normal ratio < 0.8), no change (0.8-1.2), 
low- (1.2-2), medium- (2-4), and high-level (>4) amplification. P values for pair-wise Student's t tests, comparing averages between Bdjacent dasses (moving 
lefttoriflht),are4X 10~«. 1 x 10-",5 x 10~* i x 10- 2 (ceil lines), and 1 X tOT 41 , 1 x 10- II4 .5 x 10"* 1 , 1 x 10"« (tumors). (6) Distribution of correlations between 
DNA copy number and mRNA levels, for 6,095 different human genes across 37 breast tumor samples, (c) Plot of observed versus expected correlation coefficients. 
The expected values were obtained by randpmization of the sample labels in the DNA copy number data set. The line of unity Is indicated. <d) Percent variance 
In gene expression (among tumors) directly explained by variation In gene copy number. Percent variance explained (black line) arid fraction of data retained 
(gray fine) are plotted for different fluorescence intensity/background (a rough surrogate for signal/noise) cutoff values. Fraction of data retained is relative 
to the 1.2 Intensity/background cutoff. Details of the linear regression model used to estimate the fraction of variation In gene expression attributable to 
underlying DNA copy number alteration can be found In the supporting Information (see estimating the Fraction of Variation In Gene Expression Attributable 
to Underlying DNA Copy Number Alteration). 



tumors that could be attributed to underlying variation in DNA 
copy number. From this analysis, we estimate that, overall, about 
7% of ail of the observed variation in mRNA levels can be 
explained directly by variation in copy number of the altered 
genes (Fig. <W). We can reduce the effects of experimental 
measurement error on this estimate by using only that fraction ' 
of the data most, reliably measured (fluorescence intensity/ 
background >3); using that data, our estimate of the percent 
variation in mRNA levels directly attributed to variation in gene 
copy number increases to 12% (Fig. Ad). This still undoubtedly 
represent* a significant underestimate, as the observed variation 
in global gene expression is affected not oniy by true variation in 
the expression programs of the tumor cells themselves, but also 
by the variable presence of non-tumor cell types within clinical 
samples. 

Discussion 

This genome-wide, array CGH analysis of DNA copy number 
alteration in a series of human breast tumors demonstrates the 
usefulness of defining amplicon boundaries at high resolution 
(gene-by-gene), and quantitatively measuring amplicon shape, to 
assist in locating and identifying candidate oncogenes. By ana- 
lyzing mRNA levels in parallel, we have also discovered that 
changes in J)NA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 



"cell lines and tumors. Although the DNA microarrays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we are examining such a large traction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we are likely underestimating the 
contribution of DNA copy number changes to altered gene 
expression, we believe our findings are likely to be generalizable 
(but would nevertheless still be remarkable if only applicable to 
this set of -6,100 genes). 

In budding yeast, aneuploidy has been shown to result in 
chromosome-wide gene expression biases (13). Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene expression in cancer cells* In 
agreement with our findings, Phillips etaL (14) have shown that 
with the acquisition of tumorigenicity in an immortalized pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a statistically significant respective increase and 
decrease in the average expression level of involved genes. In 
contrast, Platzer et dl (15) recently, reported that in metastatic 
colon tumors only ~~4% of genes within amplified regions were 
found more highly (>2-fold) expressed, when compared with 
normal colonic epithelium. This report differs substantially from 
our finding that 62% of highly amplified genes in breast cancer 
exhibit at least 2-fold increased expression. These contrasting 
findings may reflect methodological differences between the 
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studies. For example, the study of Platzer et aL (15) may have 
systematically under-measured gene expression changes. In this 
regard it is remarkable that only 14 transcripts of many thousand 
residing within unamplirled chromosomal regions were found to 
exhibit at least 4-fofd altered expression in metastatic colon 
cancer Additionally, their reliance on lower-resolution chromo- 
somal GGH may have resulted in poorly delimiting the bound- 
aries of high-complexity amplicons, effectively overcalling re- 
gions with amplification. Alternatively, the contrasting findings 
for amplified genes may represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNA copy number alteration has 
a large, pervasive and direct effect on global gene expression 
patterns in breast cancer has several important implications. 
First, this finding supports a high degree .of copy number- 
. dependent gene expression in tumors. Second, it suggests that 
most genes are not subject to specific auto regulation or dosage 
compensation. Third, this rinding cautions that elevated expres- 
sion of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in tumor* 
jgenesis. In our study,- fully 62% of highly amplified genes 
demonstrated moderately or highly elevated expression. This 
highlights the importance of high-resolution mapping of ampli- 
con boundaries and shape [to identify the "driving** gene(s) 
within amplicons (16)], on a large number of samples, in addition 
to functional studies. Fourth, this rinding suggests that analyzing 
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the genomic distribution of expressed genes, even within existing 
micro array gene expression data sets, may permit the inference 
of DNA copy numoer aberration, particularly aneuploidy (where 
gene expression can be averaged across large chromosomal 
regions; see Fig. 3 and supporting information). Fifth, this 
finding implies that a substantial, portion of the phenotypic 
uniqueness (and . by extension/ the heterogeneity in clinical 
behavior) among patients' tumors may be traceable to underly- 
ing variation in DNA copy number. Sixth, this finding supports, 
a possible role for widespread DNA copy number alteration in 
tumorigenesis (17, 18), beyond the amplification of specific 
oncogenes and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread imbalance in gene expression, might disrupt critical 
stochiometric relationships in cell metabolism and physiology 
(e.g., proteosome, mitotic spindle), possibly promoting further 
chromosomal instability and directly contributing to tumor 
development or progression. Finally, our findings suggest the' 
possibility of cancer therapies that exploit specific or global 
imbalances in gene expression in cancer. 
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12. Ashkenazi. A ., and Dixit, V. Apoptosis control by death and decoy receptors. 
Curr. Opin. Cell. Biol. 11, 255-260 (1999). 



9 



13. Ashkenazi A . Chapters on ApolVTRASL; DR4, DR5, DcRl, DcR2; and DcR3. 
Online Cytokine Handbook (www'apnet.com/cvtokinkeferenceA . 

14. . Ashkenazi, A , Targeting death and decoy receptors of the.tunW. necrosis ifector 

superfamily. Nature Rev. Cancer 2, 420-430 (2002). ) ' 
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27. Apoptosis control by death and decoy receptors. Gordon Research Conference on 
Apoptosis, New London, NH, June 1999. 

28. Apoptosis control by death and decoy receptors. Arthritis Foundation Research 
Conference, Alexandria GA, Aug i 999. 
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34. The Apo2L/TRAJL system: therapeutic potential. Cap-CURB summit meeting. 
Santa Monica, CA, June 2000. 
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Bach year, over 182,000 women in the United States are 
diagnosed with breast cancer, and approximately 45,000 die 
of the disease. 1 Incidence appears to he increasing in the 
United States at a rate of roughly 2% per year. The reasons 
for the increase are unclear, but non-genetic risk factors appear 
1o play a large role. 2 

Five-year survival rates range from approximately 65%- 
85%, depending on demographic group, with a significant 
percentage of women experiencing recurrence of their cancer 
within 10 years of diagnosis. One of the factors most predic- 
tive for recurrence once a diagnosis of breast cancer has been 
made is the number of axillary lymph nodes to which tumor 
has metastasized. Most node-positive women are given adju- 
vant therapy, which increases their survival. . However, 20%- 
30% of patients without axillary node involvement also 
develop recurrent disease, and the difficulty lies in how to iden- 
tify this high-risk subset of patients. These patients could 
benefit from increased surveillance, early intervention, and 
treatment. 

Prognostic markers currently used in breast cancer recur- 
rence prediction include tumor size, histological grade, steroid 
hormone receptor status, DNA ploidy, proliferative index, and 
cathepsin D status. Expression of growth factor receptors and 
over-expression of the HER-2/neu oncogene have also been 
identified as having value regarding treatment regimen and 
prognosis. 

HER-2/neu (also known as c-erbB2) is an oncogene that 
encodes a transmembrane glycoprotein that is homologous 
to, but distinct from, the epidermal growth factor receptor. 
Numerous studies have indicated that high levels of expres- 
sion of this protein are associated with rapid tumor growth, 
certain forms of therapy resistance, and shorter disease-free 
survival. The gene has been shown to be amplified and/or 
overexpressed in 10%-30% of invasive breast cancers and in 
40%-60% of intraductal breast carcinoma. 3 

There are two distinct FDA-approved methods by which 
HER-2/neu status can be evaluated: immunohistochemistry 
(1HC, HercepTest™) and FISH (fluorescent in situ hybridiza- 
tion, FathVysion™ Kit). Both methods can be performed on 
archived and current specimens. The first method allows visual 
assessment of the amount of HER-2/neu protein present on 
the cell membrane. The latter method allows direct quantifi- 
cation of the level of gene amplification present in the tumor, 
enabling differentiation between low- versus high-amplifica- 
tioU At least one study has demonstrated a difference in 



recurrence risk im women younger than 40 years of age for 
low-, versus high-amplified tumors (54.5% compared to 
85.7%); this is compared to a recurrence rate of 16.7% for 
patients with no HER-2/neu gene amplification. 4 HER-2/neu 
status may be particularly important to establish in women with 
small (£ l em) tumor size, . 

the choice of methodology for determination of HER-2/ 
neu status depends in part on the clinical setting. FDA approval, 
for the Vysis FISH test was granted based on clinical trials 
involving 1 1549 node-positive patients. Patients received one 
of three different treatments consisting. of different doses of 
cyclophosphamide, Adriamycin, and 5-fluorouracH (CAF). 
The study showed that patients with amplified HER-2/neu 
benefited from treatment with higher doses of adriamycin- 
based therapy, while those with normal HER-2/neu levels did 
not. The study , therefore identified a sub-set of women, who 
because they did not benefit from more aggressive treatment, 
did not need to be exposed to the associated side effects. In 
addition, other evidence indicates that HER-2/neu amplifica- 
tion in node-negative patients can be used as an independent 
prognostic indicator for early recurrence, recurrent disease at 
any time and disease-related death. 5 -Demonstration of HER- 
2/neu gene amplification by FISH has also been shown to be 
of value in predicting response to chemotherapy in stage-2 
breast cancer patients. 

Selection of patients for Herceptm 0 (Trastuzumab) mono- 
clonal antibody therapy, however, is based upon demonstra- 
tion of HER-2/neu protein overexpression using HercepTest™. 
Studies using Herceptin 0 in patients with metastatic breast 
cancer show an increase in time to disease progression, 
increased response rate to chemotherapeutic agents and a small 
increase in overall survival rate. The FISH assays have not yet 
been approved for this purpose, arid studies looking 'at response 
to Herceptin 0 in patients with or without gene amplification 
status determined by FISH are in progress. 

In genera], FISH and IHC results correlate well. However, 
subsets of tumors are found which show discordant results; 
i.e.,protetn overexpression without gene amplification or lack 
of protein overexpression with gene amplification. The clini- 
cal significance of such results is unclear. Based on the above 
considerations, HER-2/neu testing at SHMC/PAML will uti- 
lize immunohistochemistry (HercepTest 0 ) as a screen, fol- 
lowed by FISH in IHC-negatiye cases. Alternatively, either 
method may be ordered individually depending on the clini- 
cal setting or clinician preference. 
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CPT code information 

/. 

HER-2/aeu via IHC 
88342 (including interpretive report) 

HEK-2/neu via FISH 

8827 1 x2 Molecular cytogenetics, DNA probe, each 
88274 Molecular cytogenetics, interphase in situ hybrid- 
ization, analyze 25-99 cells 
8829 1 Cytogenetics and molecular cytogenetics, interpre- 
tation and, report 

Procedural Information 

Imraunohislochemistry is performed using tile FDA-approved 
DAKO antibody kit, Herceptcst 0 . The DAKO kit contains 
reagents required to complete a two-step immnnohisto- 
chemical staining procedure forroutinely processed, paraffin- 
embedded specimens. Following incubation with the primary 
rabbit antibody to human HER-2Aieu protein, the kit employs 
a ready-to^se dextran-based visualization reagent. This re- 
agent consists of both secondary goat anti-rabbit antibody 
molecules with horseradish peroxidase molecules linked to a 
common dextrao polymer backbone, thus eliminating the need 
for sequential application of link antibody and peroxidase 
conjugated antibody. Enzymatic conversion of the subse- 
quently added chromogen results in formation of visible 
reaction product at the antigen site; The specimen is then coun- 
terstained; a pathologist using light-microscopy interprets 
• results. 

FISH analysis at SHMC/PAML is performed using the 
FDA-approved Path Vysion™ HBR-2/neu DNA probe kit, pro- 
duced by Vysis, Inc. Formalin fixed, parafBn-embedded breast 
tissue is processed using routine histological methods, and then 
slides are treated to allow hybridization of DNA probes to the 
nuclei present in the tissue section. ThePathvysion™ kit con- 
tains two direct-labeled DNA probes, one specific for the 
alphoid repetitive DNA (CEP 1 7, spectrum orange) present at 
the chromosome 17 centromere and the second for the HER- 
2/neu oncogene located at 17ql 1.2-12 (spectrum green). Enu- 
meration of the probes allows a ratio of the number of copies 
of chromosome 1 7 to the number of copies of HER-2/neu to 
be obtained; this enables quantification of low versus high 
amplification levels, and allows an estimate of the percentage 
of cells with HER-2/heu gene amplification. The clinically 
relevant distinction is whether the gene amplification is due 
to increased gene copy number on the two chromosome 17 
. homologues normally present or an increase in the number of 
chromosome 17s in the cells. In the majority of cases, ratio 
equivalents less than 2.0 are indicative of a normal/negative 
result, ratios of 2.1 and over indicate that amplification is 
present and to what degree. Interpretation of this data will be 
performed and reported from the Vysis-certified Cytogenet- 
ics laboratory at SHMC. 
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